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DRUG TARGET ISOGENES: 
POLYMORPHISMS IN THE OSTEOCLASTOGENESIS INHIBITORY FACTOR GENE 



RELATED APPLICATIONS 
5 This application is a continuation-in-part of, and claims priority to, U.S. Provisional Application 

Serial No. 60/143,020 filed July 9, 1999. 

FIELD OF THE INVENTION 

This invention relates to variation in genes that encode pharmaceutical^ important proteins. In 
10 particular, this invention provides genetic variants of the human Osteoclastogenesis Inhibitory Factor 
(TNFRSF1 IB) gene and methods for identifying which variant(s) of this gene is/are possessed by an 
individual. 

BACKGROUND OF THE INVENTION 

1 5 Current methods for identifying pharmaceuticals to treat disease often start by identifying, 

cloning, and expressing an important target protein related to the disease. A determination of whether an 
agonist or antagonist is needed to produce an effect that may benefit a patient with the disease is then 
made. Then, vast numbers of compounds are screened against the target protein to find new potential 
drugs. The desired outcome of this process is a drug that is specific for the target, thereby reducing the 

20 incidence of the undesired side effects usually caused by a compound's activity at non-intended targets. 

What this approach fails to consider, however, is that natural variability exists in any and every 
population with respect to a particular protein. A target protein currently used to screen drugs typically is 
expressed by a gene cloned from an individual who was arbitrarily selected. However, the nucleotide 
sequence of a particular gene may vary tremendously among individuals. Subtle alteration(s) in the 

25 primary nucleotide sequence of a gene encoding a target protein may be manifested as significant 

variation in expression of or in the structure and/or function of the protein. Such alterations may explain 
the relatively high degree of uncertainty inherent in treatment of individuals with drugs whose design is 
• based upon a single representative example of the target. For example, it is well-established that some 
classes of drugs frequently have lower efficacy in some individuals than others, which means such 

30 individuals and their physicians must weigh the possible benefit of a larger dosage against a greater risk 
of side effects. In addition, variable information on the biological function or effects of a particular 
protein may be due to different scientists unknowingly studying different isoforms of the gene encoding 
the protein. Thus, information on the type and frequency of genomic variation that exists for 
pharmaceutically important proteins would be useful. 

35 The organization of single nucleotide variations (polymorphisms) in the primary sequence of a 

gene into one of the limited number of combinations that exist as units of inheritance is termed a 
haplotype. Each haplotype therefore contains significantly more information than individual unorganized 

1 
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polymorphisms. Haplotypes provide an accurate measurement of the genomic variation in the two 
chromosomes of an individual. 

It is well-established that many diseases are associated with specific variations in gene sequences. 
However while there are examples in which individual polymorphisms act as genetic markers for a 
5 particular phenotype, in other cases an individual polymorphism may be found in a variety of genomic 
backgrounds and therefore shows no definitive coupling between the polymorphism and the causative site 
for the phenotype (Clark AG et al. 1998 Am J Hum Genet 63:595-61 2; Ulbrecht M et al. 2000 Am J 
Respir Crit Care Med 161: 469-74). In addition, the marker may be predictive in some populations, but 
not in other populations (Clark AG et al. 1998 supra). In these instances, a haplotype will provide a 

10 superior genetic marker for the phenotype (Clark AG et al. 1998 supra; Ulbrecht M et al. 2000, supra; 
Ruano G & Stephens JC Gen Eng News 19 (21), December 1999). 

Analysis of the association between each observed haplotype and a particular phenotype permits 
ranking of each haplotype by its statistical power of prediction for the phenotype. Haplotypes found to be 
strongly associated with the phenotype can then have that positive association confirmed by alternative 

1 5 methods to minimize false positives. For a gene suspected to be associated with a particular phenotype, if 
no observed haplotypes for that gene show association with the phenotype of interest, then it may be 
inferred that variation in the gene has little, if any, involvement with that phenotype (Ruaflo & Stephens 
1999, supra). Thus, information on the observed haplotypes and their frequency of occurrence in various 
population groups will be useful in a variety of research and clinical applications. 

20 One possible drug target for the treatment of osteoporosis and other disorders caused by 

abnormal osteoclast recruitment and function is the Osteoclastogenesis Inhibitory Factor (TNFRSF1 IB) 
gene or its encoded product. TNFRSF1 IB is also known as Osteoprotegerin, (OPG) (Simonet et al., 90 
Cell 89(2):309-319, 1997) and TNF receptor-like molecule 1 (TR1) (Takahashi et al., Endocr. Rev. 
20:345-357, 1999), and is also sometimes referred to by the symbols OCDF or OIF. TNFRSF1 IB is a 

25 soluble member of the tumor necrosis factor receptor (TNFR) superfamily and binds to at least one TNF- 
related cytokine, RANKL (also known as TRANCE, OPGL, ODF), which stimulates differentiation of 
osteoclasts, which are the bone resorbing cells in the body, from osteoclast precursors by binding to 
RANK, a TNF receptor family member (Tsuda E. & Higashio K 1998 Nippon Rinsho 56: 1435-9). In 
vitro studies have shown that TNFRSF1 IB neutralizes RANKL-induced osteoclastogenesis by binding to 

30 RANKL, suggesting that TNFRSF1 IB is actually a secreted "decoy" receptor for RANKL that blocks 
initiation of a critical RANK-RANKL signal transduction pathway within osteoclast precursor cells 
(Takahshi et al., Biochem. Biophys, Res. Commun. 256:449-455, 1999). As a result of this blocking 
action, the number of mature osteoclasts is decreased. In vivo, TNFRSF1 IB increases bone mineral 
density and bone volume in normal rats {Nippon Rinsho 56:1435-1439, 1998), and also exhibits 

35 hypocalcemic effects in normal mice and in hypercalcemic nude mice carrying tumors associated with 
humoral hypercalcemia of malignancy {Bone 23:495-498, 1998). Also, it was reported that TNFRSF1 IB 
knock-out mice develop severe osteoporosis due to enhanced osteoclastogenesis when they grew to be 
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adults (Mizuno et al., Biochem. Biophys. Res. Commun. 247:610-615, 1998). Thus, along with RANKL 
and RANK, TNFRSF1 IB is one of the key molecules that regulate osteoclast recruitment and function, 
and as such, an understanding of variation in the TNFRSF1 IB gene should be useful in developing new 
therapies for metabolic diseases caused by abnormal osteoclast recruitment and function such as 
5 osteopetrosis, osteoporosis, metastatic bone disease, Paget's disease, rheumatoid arthritis, and periodontal 
bone disease. 

The human, mouse and rat TNFRSF1 IB proteins are all 401 amino acids in length, with human 
and rat TNFRSF1 IB having 94% amino acid (a.a.) sequence identity (Akatsu et al. 1998 Bone 23: 495-8). 
Within the TNFR superfamily, TNFRSF1 IB is most similar to TNFRII and CD40, in that this secreted 
10 protein has no transmembrane segment, and circulates as a disulfide-linked homodimer. TNFRSF1 IB 
has four cysteine-rich domains and two death domain homologous regions present in tandem at the C- 
terminal portion of the protein (Morinaga et al., Eur. J. Biochem. 254:6850691, 1998; Mizuno et al., Gene 
214:339-343, 1998). 

Human TNFRSF1 IB is encoded by a single-copy gene having five exons and four introns which 
1 5 span 29 kilobases (kb) on chromosome 8q24 of the human genome (Morinaga et al., Eur. J. Biochem. 
254:685-691 , 1998). Although the full genomic sequence has not been published, a reference sequence 
for this gene comprises the partial sequences shown in Fig. 1 (GenBank Accession No. AB008821.1; 
SEQ ID NO: 1 ), which includes the 5 ' untranslated region and the coding sequence for exon 1 , and Fig. 2 
(GenBank Accession No. AB008822.1; SEQ ID NO:2), which includes the coding sequences for exons 2- 
20 5 as well as the 3 ' untranslated region. Reference sequences for an TNFRSF1 IB cDNA and protein are 
shown in Figures 3 (SEQ ID NO:3) and 4 (SEQ ID NO:4), respectively. 

Expression regulatory elements identified in this nearly complete genomic sequence include (1) a 
major transcription initiation site located 67 nucleotides (nt) upstream of the initiation ATG codon, (2) a 
translation-termination codon in exon 5, and (3) a typical poly(A)-addition signal located 173 nt 
25 downstream of the translation-termination codon (Morinaga et al., supra). Two TNFRSF1 IB transcripts 
of 4.2 kb and 6.5 kb have been detected in IMR-90 cells, with the shorter transcript containing the 3' half 
of intron 2 and the longer transcript containing all of intron 2 (Morinaga et al., supra). 

One group has reported finding single nucleotide polymorphisms (SNPs) at positions 9 and 22 of 
exon 1, which correspond to positions 1181 and 11 94 of GENBANK Accession #AB008821.l (Yasuda et 
30 al., Endocrinology 139:1329-1337, 1997; GENBANK Acc. No. E15271.1). The SNPs at these sites 
would result in variation in the encoded amino acid sequence at position 3 (lysine or asparagine) and/or 
position 8 (alanine or serine), respectively, depending on the particular combination of nucleotides'found 
at these polymorphic sites in one of the two copies of the TNFRSF1 IB gene from an individual. 

Because of the potential for polymorphisms in the TNFRSF1 IB gene to affect the expression and 
. 35 function of the encoded protein, it would be useful to determine whether additional polymorphisms exist 
in the TNFRSF1 IB gene, as well as how such polymorphisms are combined in different copies of the 
gene. Such information would be useful for studying the biological function of TNFRSF1 IB as well as in 

3 
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identifying drugs targeting this protein for the treatment of disorders related to its abnormal expression or 
function. 



SUMMARY OF THE INVENTION » 

5 Accordingly, the inventors herein have discovered 24 novel polymorphic sites in the 

TNFRSF1 IB gene. These polymorphic sites (PS) correspond to the following nucleotide positions in the 
indicated GenBank Accession Number: 491 (PS1), 676 (PS2), 889 (PS3), 916 (PS4), 950 (PS5), 1217 
(PS7), 1294 (PS8) and 1390 (PS9) in AB008821; 505 (PS10), 668 (PS1 1), 4397 (PS12), 4501 (PS13), 
6601 (PS14), 6824 (PS15), 6839 (PS16), 6845 (PS17), 6893 (PS18), 6950 (PS19), 8258 (PS20), 8391 

10 (PS21), 8622 (PS22), 8694 (PS23), 8955 (PS24) and 9003 (PS25) in AB008822. The polymorphisms at 
these sites are guanine or adenine at PS1, guanine or thymine at PS2, cytosine or thymine at PS3, guanine 
or thymine at PS4, thymine or cytosine at PS5, cytosine or thymine at PS7, guanine or adenine at PS8, 
cytosine or adenine at PS9, cytosine or thymine at PS 10, thymine or cytosine at PS1 1, thymine or 
cytosine at PS12, cytosine or thymine at PS13, guanine or adenine at PS14, cytosine or thymine at PS15, 

1 5 guanine or adenine in PS 1 6, adenine or guanine at PS 1 7, adenine or guanine at PS 1 8, adenine or cytosine 
at PS 19, guanine or adenine at PS20, thymine or cytosine at PS21, guanine or thymine at PS22, guanine 
or adenine at PS23, adenine or thymine at PS24, and thymine or cytosine at PS25. In addition, the 
inventors confirmed the presence of the previously reported polymorphic site at nucleotide 1181 (PS6) in 
Figure 1. The polymorphic site at 1 194 was not detected in the experiments described herein. It is 

20 believed that TNFRSF1 IB-encoding polynucleotides containing one or more of the novel polymorphic 
sites reported herein will be useful in studying the expression and biological function of TNFRSF1 IB, as 
well as in developing drugs targeting this protein. In addition, information on the combinations of 
polymorphisms in the TNFRSF1 IB gene may have diagnostic and forensic applications. 

Thus, in one embodiment, the invention provides an isolated polynucleotide comprising a 

25 nucleotide sequence which is a polymorphic variant of a reference sequence for the TNFRSF1 IB gene or 
a fragment thereof. The reference sequence comprises SEQ ID NOS: 1-2 and the polymorphic variant 
comprises at least one polymorphism selected from the group consisting of adenine at PS1, thymine at 
PS2, thymine at PS3, thymine at PS4, cytosine at PS5, thymine at PS7, adenine at PS8, adenine at PS9, 
thymine at PS10, cytosine at PS1 1, cytosine at PS12, thymine at PS13, adenine at PS14, thymine at PS15, 

30 adenine at PS 1 6, guanine at PS 1 7, guanine at PS 1 8, cytosine at PS 1 9, adenine at PS20, cytosine at PS2 1 , 
thymine at PS22, adenine at PS23, thymine at PS24 and cytosine at PS25. In a preferred embodiment, the 
polymorphic variant comprises an additional polymorphism of cytosine at PS6. A particularly preferred 
polymorphic variant is a naturally-occurring isoform (also referred to herein as an "isogene") of the 
TNFRSF1 IB gene. A TNFRSF1 IB isogene of the invention comprises guanine or adenine at PS1, 

35 guanine or thymine at PS2, cytosine or thymine at PS3, guanine or thymine at PS4, thymine or cytosine at 
PS5, cytosine or thymine at PS7, guanine or adenine at PS8, cytosine or adenine at PS9, cytosine or 
thymine at PS10, thymine or cytosine at PS1 1, thymine or cytosine at PS12, cytosine or thymine at PS1 3, 
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guanine or adenine at PS14, cytosine or thymine at PS 1 5, guanine or adenine in PS 16, adenine or guanine 
at PS 17, adenine or guanine at PS 18, adenine or cytosine at PS 19, guanine or adenine at PS20, thymine or 
cytosine at PS21, guanine or thymine at PS22, guanine or adenine at PS23, adenine or thymine at PS24, 
and thymine or cytosine at PS25. The invention also provides a collection of TNFRSF1 IB isogenes, 
5 referred to herein as a TNFRSF 1 1 B genome anthology. 

A TNFRSF 1 IB isogene may be defined by the combination and order of these polymorphisms in 
the isogene, which is referred to herein as a TNFRSF 1 IB haplotype. Thus, the invention also provides 
data on the number of different TNFRSF 1 IB haplotypes found in the reference populations used in the 
experiments described herein. This haplotype data is useful in methods for deriving a TNFRSF 1 IB 
10 haplotype from an individual's genotype for the TNFRSF 1 IB gene and for determining an association 
between a TNFRSF1 IB haplotype and a particular trait. 

In another embodiment, the invention provides a polynucleotide comprising a polymorphic 
variant of a reference sequence for a TNFRSF 1 IB cDNA or a fragment thereof. The reference sequence 
comprises SEQ ID NO:3 (Fig. 3) and the polymorphic cDNA comprises at least one polymorphism 
1 5 selected from the group consisting of thymine at a position corresponding to nucleotide 699, adenine at a 
position corresponding to nucleotide 714, guanine at a position corresponding to nucleotide 720, guanine 
at a position corresponding to nucleotide 768, adenine at a position corresponding to nucleotide 841, 
thymine at a position corresponding to nucleotide 1 102 and cytosine at a position corresponding to 
nucleotide 1 150. In a preferred embodiment, the polymorphic variant comprises an additional 
20 polymorphism of cytosine at a position corresponding to nucleotide 9 in Figure 3. 

Polynucleotides complementary to these TNFRSF1 IB genomic and cDNA variants are also 
provided by the invention. 

In other embodiments, the invention provides a recombinant expression vector comprising one of 
the polymorphic genomic variants operably linked to expression regulatory elements as well as a 
25 recombinant host cell transformed or transfected with the expression vector. The recombinant vector and 
host cell may be used to express TNFRSF 1 IB for protein structure analysis and drug binding studies. 

In yet another embodiment, the invention provides a polypeptide comprising a polymorphic 
variant of a reference amino acid sequence for the TNFRSF 1 IB protein. The reference amino acid 
sequence comprises SEQ ID NO:4 (Fig. 4) and the polymorphic variant comprises at least one variant 
30 amino acid selected from the group consisting of methionine at a position corresponding to amino acid 
position 240, methionine at a position corresponding to amino acid position 281, and serine at a position 
corresponding to amino acid 368. In some embodiments, the polymorphic variant also comprises 
asparagine at a position corresponding to amino acid 3 in Figure 4. A polymorphic variant of 
TNFRSF1 IB is useful in studying the effect of the variation on the biological activity of TNFRSF1 IB as 
35 well as studying the binding affinity of candidate drugs targeting TNFRSF1 1 B for the treatment of 
osteoporosis and other disorders caused by abnormal osteoclast recruitment and function. 

The present invention also provides antibodies that recognize and bind to the above polymorphic 
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TNFRSF1 1 B protein variant. Such antibodies can be utilized in a variety of diagnostic and prognostic 
formats and therapeutic methods. 

In other embodiments, the invention provides methods, compositions, and kits for haplotyping 
and/or genotyping the TNFRSF1 IB gene in an individual. The methods involve identifying the 
5 nucleotide or nucleotide pair present at one or more polymorphic sites selected from PSl-5,PS7-25 in one 
or both copies of the TNFRSF1 IB gene from the individual. The compositions contain oligonucleotide 
probes and primers designed to specifically hybridize to one or more target regions containing, or that are 
adjacent to, a polymorphic site. The methods and compositions for establishing the genotype or 
haplotype of an individual at the novel polymorphic sites described herein are useful for studying the 

10 effect of the polymorphisms in the etiology of diseases affected by the expression and function of the 
TNFRSF1 IB protein, studying the efficacy of drugs targeting TNFRSF1 IB, predicting individual 
susceptibility to diseases affected by the expression and function of the TNFRSF1 IB protein and 
predicting individual responsiveness to drugs targeting TNFRSF1 IB. 

In yet another embodiment, the invention provides a method for identifying an association 

15 between a genotype or haplotype and a trait. In preferred embodiments, the trait is susceptibility to a 
disease, severity of a disease, the staging of a disease or response to a drug. Such methods have 
applicability in developing diagnostic tests and therapeutic treatments for osteoporosis and other disorders 
caused by abnormal osteoclast recruitment and function. 

The present invention also provides transgenic animals comprising one of the TNFRSF1 IB 

20 genomic polymorphic variants described herein and methods for producing such animals. The transgenic 
animals are useful for studying expression of the TNFRSF1 IB isogenes in vivo, for in vivo screening and 
testing of drugs targeted against TNFRSF1 IB protein, and for testing the efficacy of therapeutic agents 
and compounds for osteoporosis and other disorders caused by abnormal osteoclast recruitment and 
function in a biological system. 

25 The present invention also provides a computer system for storing and displaying polymorphism 

data determined for the TNFRSF1 IB gene. The computer system comprises a computer processing unit; 
a display; and a database containing the polymorphism data. The polymorphism data includes the 
polymorphisms, the genotypes and the haplotypes identified for the TNFRSF1 IB gene in a reference 
population. In a preferred embodiment, the computer system is capable of producing a display showing 

30 TNFRSF1 IB haplotypes organized according to their evolutionary relationships. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates a partial reference sequence for the TNFRSF1 IB gene (Genbank Version 
Number AB008821.1; contiguous lines; SEQ ID NO:l), with the underline indicating the start codon, 
35 shading indicating the reference coding sequence, and bold nucleotides indicating the polymorphic sites 
and polymorphisms identified by Applicants in a reference population. 

Figure 2 illustrates a partial reference sequence for the TNFRSF1 IB gene (Genbank Version 
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Number AB008822.1; contiguous lines; SEQ ID NO:2), with the underline indicating the stop codon, 
shading indicating the reference coding sequence, and bold nucleotides indicating the polymorphic sites 
and polymorphisms identified by Applicants in a reference population. 

Figure 3 illustrates a reference sequence for the TNFRSF1 IB coding sequence (GENBANK 
5 ACC# AB002146; contiguous lines; SEQ ID NO:3) with underlines indicating the start and stop codons, 
and bold nucleotides indicating the polymorphic sites and polymorphisms identified by Applicants in a 
reference population. 

Figure 4 illustrates a reference sequence for the TNFRSF1 IB protein (GENBANK ACC # 
BAA25910; contiguous lines; SEQ ID NO:4) with the bold amino acids indicating the amino acid 
1 0 variations caused by the polymorphisms of Figure 3 . 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention is based on the discovery of novel variants of the TNFRSF1 IB gene. As 
described in more detail below, the inventors herein discovered 24 novel polymorphic sites by 

1 5 characterizing the TNFRSF 1 1 B gene found in genomic DNAs isolated from Index Repository IA that 
contains immortalized cell lines from one chimpanzee and 93 human individuals and Index Repository IB 
that contains 70 human individuals. These two repositories contain 51 individuals in common. 

The human individuals in Index Repository IA included a reference population of 79 unrelated 
individuals self-identified as belonging to one of four major population groups: Caucasian (22 

20 individuals), African descent (20 individuals) Asian (20 individuals) Hispanic/Latino ( 1 7 individuals). 
To the extent possible, the members of this reference population were organized into population 
subgroups by the self-identified ethnogeographic origin of their four grandparents as shown in Table 1 
below. In addition, Index Repository IA contains three unrelated indigenous American Indians (one from 
each of North, Central and South America), one three-generation Caucasian family (from the CEPH Utah 

25 cohort) and one two-generation African-American family. 
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Table 1. Population Groups in Index" Repository IA 



Population Group 


Population Subgroup 


No. of Individuals 


African descent 




20 




Sierra Leone 


1 


Asian 




20 




Burma 


1 




China 


3 




Japan 


6 




Korea 


1 




Philippines 


5 




Vietnam 


4 


Caucasian 




22 




British Isles 


3 




British Isles/Central 


4 




British Isles/Eastern 


1 




Central/Eastern 


1 




Eastern 


3 . 




Central/Mediterranean 


1 




Mediterranean 


2 




Scandinavian 


2 


Hispanic/Latino 




17 




Caribbean 


7 




Caribbean (Spanish Descent) 


2 




Central American (Spanish Descent) 


1 




Mexican American 


4 




South American (Spanish Descent) 


3 



Index Repository IB contains a reference population of 70 human individuals comprised of 4 
5 three-generation families (from the CEPH Utah cohort) as well as unrelated African-American, Asian and 
Caucasian individuals. A total of 38 individuals in this reference population are unrelated. 

Using the TNFRSF1 IB genotypes identified in the Index Repositories and the methodology 
described in the Examples below, the inventors herein also determined the haplotypes found on each 
chromosome for most human members of these repositories. The TNFRSF1 1 B genotypes and haplotypes 
10 found in the Index Repositories include those shown in Tables 4 and 5, respectively. The polymorphism 
and haplotype data disclosed herein are useful for studying population diversity, anthropological lineage, 
the significance of diversity and lineage at the phenotypic level, paternity testing, forensic applications, 
and for identifying associations between the TNFRSF1 IB genetic variation and a trait such as level of 
drug response or susceptibility to disease. 
15 In the context of this disclosure, the following terms shall be defined as follows unless otherwise 

indicated: 

Allele - A particular form of a genetic locus, distinguished from other forms by its particular 
nucleotide sequence. 

Candidate Gene - A gene which is hypothesized to be responsible for a disease, condition, or the 
20 response to a treatment, or to be correlated with one of these. 

8 
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Gene - A segment of DNA that contains all the information for the regulated biosynthesis of an 
RNA product, including promoters, exons, introns, and other untranslated regions that control expression. 

Genotype - An unphased 5' to 3' sequence of nucleotide pair(s) found at one or more 
polymorphic sites in a locus on a pair of homologous chromosomes in an individual. As used herein, 
5 genotype includes a full-genotype and/or a sub-genotype as described below. 

Full-genotype - The unphased 5' to 3' sequence of nucleotide pairs found at all known 
polymorphic sites in a locus on a pair of homologous chromosomes in a single individual. 

Sub-genotype - The unphased 5' to 3' sequence of nucleotides seen at a subset of the known 
polymorphic sites in a locus on a pair of homologous chromosomes in a single individual. 
10 Genotyping - A process for determining a genotype of an individual. 

Haplotype - A 5 ' to 3 ' sequence of nucleotides found at one or more polymorphic sites in a locus 
on a single chromosome from a single individual. As used herein, haplotype includes a full-haplotype 
and/or a sub-haplotype as described below. 

Full-haplotype - The 5' to 3' sequence of nucleotides found at all known polymorphic sites in a 
1 5 locus on a single chromosome from a single individual. 

Sub-haplotype - The 5' to 3' sequence of nucleotides seen at a subset of the known polymorphic 
sites in a locus on a single chromosome from a single individual. 

Haplotype pair - The two haplotypes found for a locus in a single individual. 

Haplotyping - A process for determining one or more haplotypes in an individual and includes 
20 use of family pedigrees, molecular techniques and/or statistical inference. 

Haplotype data - Information concerning one or more of the following for a specific gene: a 
listing of the haplotype pairs in each individual in a population; a listing of the different haplotypes in a 
population; frequency of each haplotype in that or other populations, and any known associations between 
one or more haplotypes and a trait. 
25 Isoform - A particular form of a gene, mRNA, cDNA or the protein encoded thereby, 

distinguished from other forms by its particular sequence and/or structure. 

Isogene - One of the isoforms of a gene found in a population. An isogene contains all of the 
polymorphisms present in the particular isoform of the gene. 

Isolated - As applied to a biological molecule such as RNA, DNA, oligonucleotide, or protein, 
30 isolated means the molecule is substantially free of other biological molecules such as nucleic acids, 

proteins, lipids, carbohydrates, or other material such as cellular debris and growth media. Generally, the 
term "isolated" is not intended to refer to a complete absence of such material or to absence of water, 
buffers, or salts, unless they are present in amounts that substantially interfere with the methods of the 
present invention. 

35 Locus - A location on a chromosome or DNA molecule corresponding to a gene or a physical or 

phenotypic feature. 

Naturally-occurring - A term used to designate that the object it is applied to, e.g., naturally- 
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occurring polynucleotide or polypeptide, can be isolated from a source in nature and which has not been 
intentionally modified by man. 

Nucleotide pair - The nucleotides found at a polymorphic site on the two copies of a 
chromosome from an individual. 
5 Phased - As applied to a sequence of nucleotide pairs for two or more polymorphic sites in a 

locus, phased means the combination of nucleotides present at those polymorphic sites on a single copy of 
the locus is known. 

Polymorphic site (PS) - A position within a locus at which at least two alternative sequences are 
found in a population, the most frequent of which has a frequency of no more than 99%. 
10 Polymorphic variant - A gene, mRNA, cDNA, polypeptide or peptide whose nucleotide or 

amino acid sequence varies from a reference sequence due to the presence of a polymorphism in the gene. 

Polymorphism - The sequence variation observed in an individual at a polymorphic site. 
Polymorphisms include nucleotide substitutions, insertions, deletions and microsatellites and may, but 
need not, result in detectable differences in gene expression or protein function. 
1 5 ' Polymorphism data - Information concerning one or more of the following for a specific gene: 

location of polymorphic sites; sequence variation at those sites; frequency of polymorphisms in one or 
more populations; the different genotypes and/or haplotypes determined for the gene; frequency of one or 
more of these genotypes and/or haplotypes in one or more populations; any known association(s) between 
a trait and a genotype or a haplotype for the gene. 
20 Polymorphism Database - A collection of polymorphism data arranged in a systematic or 

methodical way and capable of being individually accessed by electronic or other means. 

Polynucleotide - A nucleic acid molecule comprised of single-stranded RNA or DNA or 
comprised of complementary, double-stranded DNA. 

Population Group - A group of individuals sharing a common ethnogeographic origin. 
25 Reference Population - A group of subjects or individuals who are predicted to be representative 

of the genetic variation found in the general population. Typically, the reference population represents 
the genetic variation in the population at a certainty level of at least 85%, preferably at least 90%, more 
preferably at least 95% and even more preferably at least 99%. 

Single Nucleotide Polymorphism (SNP) - Typically, the specific pair of nucleotides observed at 
30 a single polymorphic site. In rare cases, three or four nucleotides may be found. 

Subject - A human individual whose genotypes or haplotypes or response to treatment or disease 
state are to be determined. 

Treatment - A stimulus administered internally or externally to a subject. 

Unphased - As applied to a sequence of nucleotide pairs for two or more polymorphic sites in a 
35 locus, unphased means the combination of nucleotides present at those polymorphic sites on a single copy 
of the locus is not known. 

The inventors herein have discovered 24 novel polymorphic sites, and confirmed the existence of 
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a 25 th site, in the TNFRSF1 IB gene. The polymorphic sites identified by the inventors are referred to as 
PS 1-25 to designate the order in which they are located in the gene (see Table 3 below), with the novel 
polymorphic sites referred to as PS1, PS2, PS3, PS4, PS5, PS7, PS8, PS9, PS10, PS1 1, PS12, PS13, 
PS14, PS15, PS16, PS17, PS18, PS19, PS20, PS21, PS22, PS23, PS24, and PS 25, and the previously 
5 reported polymorphic site referred to as PS6. 

Thus, in one embodiment, the invention provides an isolated polynucleotide comprising a 
polymorphic variant of the TNFRSF1 IB gene or a fragment of the gene which contains at least one of the 
novel polymorphic sites described herein. The nucleotide sequence of a variant TNFRSF1 IB gene is 
identical to the reference genomic sequence for those portions of the gene examined, as described in the 

10 Examples below, except that it comprises a different nucleotide at one or more of the novel polymorphic 
sites selected from the group consisting of PS1, PS2, PS3, PS4, PS5, PS7, PS8, PS9, PS10, PS1 1, PS12, 
PS13, PS14, PS15, PS16, PS17, PS18, PS19, PS20, PS21, PS22, PS23, PS24, and PS 25, and may also 
comprise an additional polymorphism of cytosine at PS6. Similarly, the nucleotide sequence of a variant 
fragment of the TNFRSF1 IB gene is identical to the corresponding portion of the reference sequence 

1 5 except for having a different nucleotide at one or more of the novel polymorphic sites described herein. 
Thus, the invention specifically does not include polynucleotides comprising a nucleotide sequence 
identical to the reference sequence (or other reported TNFRSF1 IB sequences) or to portions of the 
reference sequence (or other reported TNFRSF1 IB sequences), except for genotyping oligonucleotides as 
described below. 

20 The location of a polymorphism in a variant gene or fragment is identified by aligning its 

sequence against SEQ ID NOS:l-2. The polymorphism is selected from the group consisting of adenine 
at PS1, thymine at PS2, thymine at PS3, thymine at PS4, cytosine at PS5, thymine at PS7, adenine at PS8, 
adenine at PS9, thymine at PS10, cytosine at PS1 1, cytosine at PS12, thymine at PS13, adenine at PS14, 
thymine at PS15, adenine at PS16, guanine at PS17, guanine at PS18, cytosine at PS19, adenine at PS20, 

25 cytosine at PS21, thymine at PS22, adenine at PS23, thymine at PS24 and cytosine at PS25. In a 
preferred embodiment, the polymorphic variant comprises a naturally-occurring isogene of the 
TNFRSF1 IB gene which is defined by any one of haplotypes 1-27 shown in Table 5 below. 

Polymorphic variants of the invention may be prepared by isolating a clone containing the 
TNFRSF1 IB gene from a human genomic library. The clone may be sequenced to determine the identity 

30 of the nucleotides at the polymorphic sites described herein. Any particular variant claimed herein could 
be prepared from this clone by performing in vitro mutagenesis using procedures well-known in the art. 

TNFRSF1 IB isogenes may be isolated using any method that allows separation of the two 
"copies" of the TNFRSF1 IB gene present in an individual, which, as readily understood by the skilled 
artisan, may be the same allele or different alleles. Separation methods include targeted in vivo cloning 

35 (TIVC) in yeast as described in WO 98/01573, U.S. Patent No. 5,866,404, and copending U.S. 

application Serial No. 08/987,966. Another method, which is described in copending U.S. Application 
Serial No. 08/987,966, uses an allele specific oligonucleotide in combination with primer extension and 
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exonuclease degradation to generate hemizygous DNA targets. Yet other methods are single molecule 
dilution (SMD) as described in Ruafio et al., Proc. Natl. Acad. Sci. 87:6296-6300, 1990; and allele 
specific PCR (Ruafio et al., 17 Nucleic Acids. Res. 8392, 1989; Ruafio et al., 19 Nucleic Acids Res. 6877- 
6882, 1991; Michalatos-Beloin et al., 24 Nucleic Acids Res. 4841-4843, 1996). 
5 The invention also provides TNFRSF1 IB genome anthologies, which are collections of 

TNFRSF1 IB isogenes found in a given population. The population may be any group of at least two 
individuals, including but not limited to a reference population, a population group, a family population, a 
clinical population, and a same sex population. A TNFRSF1 IB genome anthology may comprise 
individual TNFRSF1 IB isogenes stored in separate containers such as microtest tubes, separate wells of a 

10 microtitre plate and the like. Alternatively, two or more groups of the TNFRSF1 IB isogenes in the 
anthology may be stored in separate containers. Individual isogenes or groups of isogenes in a genome 
anthology may be stored in any convenient and stable form, including but not limited to in buffered 
solutions, as DNA precipitates, freeze-dried preparations and the like. A preferred TNFRSF1 IB genome 
anthology of the invention comprises a set of isogenes defined by the haplotypes shown in Table 5 below. 

1 5 An isolated polynucleotide containing a polymorphic variant nucleotide sequence of the invention 

may be operably linked to one or more expression regulatory elements in a recombinant expression vector 
capable of being propagated and expressing the encoded TNFRSF1 IB protein in a prokaryotic or a 
eukaryotic host cell. Examples of expression regulatory elements which may be used include, but are not 
limited to, the lac system, operator and promoter regions of phage lambda, yeast promoters, and 

20 promoters derived from vaccinia virus, adenovirus, retroviruses, or SV40. Other regulatory elements 

include, but are not limited to, appropriate leader sequences, termination codons, polyadenylation signals, 
and other sequences required for the appropriate transcription and subsequent translation of the nucleic 
acid sequence in a given host cell. Of course, the correct combinations of expression regulatory elements 
will depend on the host system used. In addition, it is understood that the expression vector contains any 

25 additional elements necessary for its transfer to and subsequent replication in the host cell. Examples of 
such elements include, but are not limited to, origins of replication and selectable markers. Such 
expression vectors are commercially available or are readily constructed using methods known to those in 
the art (e.g., F. Ausubel et al., 1987, in "Current Protocols in Molecular Biology", John Wiley and Sons, 
New York, New York). Host cells which may be used to express the variant TNFRSF1 IB sequences of 

30 the invention include, but are not limited to, eukaryotic and mammalian cells, such as animal, plant, insect 
and yeast cells, and prokaryotic cells, such as E. coli, or algal cells as known in the art. The recombinant 
expression vector may be introduced into the host cell using any method known to those in the art 
including, but not limited to, microinjection, electroporation, particle bombardment, transduction, and 
transfection using DEAE-dextran, lipofection, or calcium phosphate (see e.g., Sambrook et al. (1989) in 

35 "Molecular Cloning. A Laboratory Manual", Cold Spring Harbor Press, Plainview, New York). In a 
preferred aspect, eukaryotic expression vectors that function in eukaryotic cells, and preferably 
mammalian cells, are used. Non-limiting examples of such vectors include vaccinia virus vectors, 
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adenovirus vectors, herpes virus vectors, and baculovirus transfer vectors. Preferred eukaryotic cell lines 
include COS cells, CHO cells, HeLa cells, NIH/3T3 cells, and embryonic stem cells (Thomson, J. A. et 
al., 1998 Science 282:1 145-1 147). Particularly preferred host cells are mammalian cells. 

As will be readily recognized by the skilled artisan, expression of polymorphic variants of the 
5 TNFRSF1 IB gene will produce TNFRSF1 IB mRNAs varying from each other at any polymorphic site 
retained in the spliced and processed mRNA molecules. These mRNAs can be used for the preparation of 
a TNFRSF1 IB cDNA comprising a nucleotide sequence which is a polymorphic variant of the 
TNFRSF1 IB reference coding sequence shown in Figure 3. Thus, the invention also provides 
TNFRSF1 IB mRNAs and corresponding cDNAs which comprise a nucleotide sequence that is identical 

10 to SEQ ID NO:3 (Fig. 3), or its corresponding RNA sequence, except for having one or more 

polymorphisms selected from the group consisting of thymine at a position corresponding to nucleotide 
699, adenine at a position corresponding to nucleotide 714, guanine at a position corresponding to 
nucleotide 720, guanine at a position corresponding to nucleotide 768, adenine at a position 
corresponding to nucleotide 841, thymine at a position corresponding to nucleotide 1 102 and cytosine at a 

1 5 position corresponding to nucleotide 1 1 50, and may also comprise an additional polymorphism of 

cytosine at a position corresponding to nucleotide 9 in Figure 3. Fragments of these variant mRNAs and 
cDNAs are included in the scope of the invention, provided they contain the novel polymorphisms 
described herein. The invention specifically excludes polynucleotides identical to previously identified 
and characterized TNFRSF1 IB cDNAs and fragments thereof. Polynucleotides comprising a variant 

20 RNA or DNA sequence may be isolated from a biological sample using well-known molecular biological 
procedures or may be chemically synthesized. 

Genomic and cDNA fragments of the invention comprise at least one novel polymorphic site 
identified herein and have a length of at least 10 nucleotides and may range up to the full length of the 
gene. Preferably, a fragment according to the present invention is between 100 and 3000 nucleotides in 

25 length, and more preferably between 200 and 2000 nucleotides in length, and most preferably between 
500 and 1000 nucleotides in length. 

In describing the polymorphic sites identified herein, reference is made to the sense strand of the 
gene for convenience. However, as recognized by the skilled artisan, nucleic acid molecules containing 
the TNFRSF1 IB gene may be complementary double stranded molecules and thus reference to a 

30 particular site on the sense strand refers as well to the corresponding site on the complementary antisense 
strand. Thus, reference may be made to the same polymorphic site on either strand and an 
oligonucleotide may be designed to hybridize specifically to either strand at a target region containing the 
polymorphic site. Thus, the invention also includes single-stranded polynucleotides which are 
complementary to the sense strand of the TNFRSF1 IB genomic variants described herein. 

35 Polynucleotides comprising a polymorphic gene variant or fragment may be useful for 

therapeutic purposes. For example, where a patient could benefit from expression, or increased 
expression, of a particular TNFRSF1 IB protein isoform, an expression vector encoding the isoform may 
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be administered to the patient. The patient may be one who lacks the TNFRSF1 IB isogene encoding that 
isoform or may already have at least one copy of that isogene. 

In other situations, it may be desirable to decrease or block expression of a particular 
TNFRSF1 IB isogene. Expression of a TNFRSF1 1 B isogene may be turned off by transforming a 
5 targeted organ, tissue or cell population with an expression vector that expresses high levels of 

untranslatable mRNA for the isogene. Alternatively, oligonucleotides directed against the regulatory 
regions (e.g., promoter, introns, enhancers, 3' untranslated region) of the isogene may block transcription. 
Oligonucleotides targeting the transcription initiation site, e.g., between positions -10 and +10 from the 
start site are preferred. Similarly, inhibition of transcription can be achieved using oligonucleotides that 

10 base-pair with region(s) of the isogene DNA to form triplex DNA (see e.g., Gee et al. in Huber, B.E. and 
B.I. Carr, Molecular and Immunologic Approaches, Futura Publishing Co., Mt. Kisco, N.Y., 1994). 
Antisense oligonucleotides may also be designed to block translation of TNFRSF1 IB mRNA transcribed 
from a particular isogene. It is also contemplated that ribozymes may be designed that can catalyze the 
specific cleavage of TNFRSF1 IB mRNA transcribed from a particular isogene. 

1 5 The oligonucleotides may be delivered to a target cell or tissue by expression from a vector 

introduced into the cell or tissue in vivo or ex vivo. Alternatively, the oligonucleotides may be formulated 
as a pharmaceutical composition for administration to the patient. Oligoribonucleotides and/or 
oligodeoxynucleotides intended for use as antisense oligonucleotides may be modified to increase 
stability and half-life. Possible modifications include, but are not limited to phosphorothioate or 2' O- 

20 methyl linkages, and the inclusion of nontraditional bases such as inosine and queosine, as well as acetyl-, 
methyl-, thio-, and similarly modified forms of adenine, cytosine, guanine, thymine, and uracil which are 
not as easily recognized by endogenous nucleases. 

The invention also provides an isolated polypeptide comprising a polymorphic variant of the 
reference TNFRSF1 IB amino acid sequence shown in Figure 4 (SEQ ID NO:4). The location of a variant 

25 amino acid in a TNFRSF1 IB polypeptide or fragment of the invention is identified by aligning its 
sequence against Fig. 4. A TNFRSF1 IB protein variant of the invention comprises an amino acid 
sequence identical to SEQ ID NO: 4 except for having one or more variant amino acids selected from the 
group consisting of methionine at a position corresponding to amino acid position 240, methionine at a 
position corresponding to amino acid position 281, and serine at a position corresponding to amino acid 

30 368, and may also comprise an additional variant amino acid of asparagine at a position corresponding to 
amino acid 3 in Figure 4. The invention specifically excludes amino acid sequences identical to those 
previously identified for TNFRSF1 IB, including SEQ ID NO: 4, and previously described fragments 
thereof. TNFRSF1 IB protein variants included within the invention comprise all amino acid sequences 
based on SEQ ID NO: 4 and having the combination of amino acid variations described in Table 2 below. 

35 In preferred embodiments, a TNFRSF1 IB protein variant of the invention is encoded by an isogene 
defined by one of the observed haplotypes shown in Table 5. 
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Table 2. Novel Protein Variants of TNFRSF11B 
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The invention also includes TNFRSF1 IB peptide variants, which are any fragments of a 
5 TNFRSF1 IB protein variant that contains one or more of the novel amino acid variations shown in Table 
2. A TNFRSF1 IB peptide variant is at least 6 amino acids in length and is preferably any number 
between 6 and 30 amino acids long, more preferably between 10 and 25, and most preferably between 15 
and 20 amino acids long. Such TNFRSF1 IB peptide variants may be useful as antigens to generate 
antibodies specific for one of the above TNFRSF1 IB isoforms. In addition, the TNFRSF1 IB peptide 

10 variants may be useful in drug screening assays. 

A TNFRSF1 IB variant protein or peptide of the invention may be prepared by chemical synthesis 
or by expressing one of the variant TNFRSF1 IB genomic and cDNA sequences as described above. 
Alternatively, the TNFRSF1 IB protein variant may be isolated from a biological sample of an individual 
having a TNFRSF1 1 B isogene which encodes the variant protein. Where the sample contains two 

15 different TNFRSF1 IB isoforms (i.e., the individual has different TNFRSF1 IB isogenes), a particular 
TNFRSF1 IB isoform of the invention can be isolated by immunoaffinity chromatography using an 
antibody which specifically binds to that particular TNFRSF1 IB isoform but does not bind to the other 
TNFRSF11B isoform. 

The expressed or isolated TNFRSF1 IB protein may be detected by methods known in the art, 
20 including Coomassie blue staining, silver staining, and Western blot analysis using antibodies specific for 
the isoform of the TNFRSF1 IB protein as discussed further below. TNFRSF1 IB variant proteins can be 
purified by standard protein purification procedures known in the art, including differential precipitation, 
molecular sieve chromatography, ion-exchange chromatography, isoelectric focusing, gel electrophoresis, 
affinity and immunoaffinity chromatography and the like. (Ausubel et. al., 1987, In Current Protocols in 
25 Molecular Biology John Wiley and Sons, New York, New York). In the case of immunoaffinity 
chromatography, antibodies specific for a particular polymorphic variant may be used. 
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A polymorphic variant TNFRSF1 IB gene of the invention may also be fused in frame with a 
heterologous sequence to encode a chimeric TNFRSF1 IB protein. The non-TNFRSFl IB portion of the 
chimeric protein may be recognized by a commercially available antibody. In addition, the chimeric 
protein may also be engineered to contain a cleavage site located between the TNFRSF1 IB and non- 
5 TNFRSF1 IB portions so that the TNFRSF1 IB protein may be cleaved and purified away from the non- 
TNFRSFl IB portion. 

An additional embodiment of the invention relates to using a novel TNFRSF1 IB protein isoform 
in any of a variety of drug screening assays. Such screening assays may be performed to identify agents 
that bind specifically to all known TNFRSF1 IB protein isoforms or to only a subset of one or more of 

10 these isoforms. The agents may be from chemical compound libraries, peptide libraries and the like. The 
TNFRSF1 IB protein or peptide variant may be free in solution or affixed to a solid support. In one 
embodiment, high throughput screening of compounds for binding to a TNFRSF1 IB variant may be 
accomplished using the method described in PCT application WO84/03565, in which large numbers of 
test compounds are synthesized on a solid substrate, such as plastic pins or some other surface, contacted 

15 with the TNFRSF1 IB protein(s) of interest and then washed. Bound TNFRSF1 IB protein(s) are then 
detected using methods well-known in the art. 

In another embodiment, a novel TNFRSF1 IB protein isoform may be used in assays to measure 
the binding affinities of one or more candidate drugs targeting the TNFRSF 1 IB protein. 

In another embodiment, the invention provides antibodies specific for and immunoreactive with 

20 one or more of the novel TNFRSF 1 IB variant proteins described herein. The antibodies may be either 
monoclonal or polyclonal in origin. The TNFRSF 1 IB protein or peptide variant used to generate the 
antibodies may be from natural or recombinant sources or produced by chemical synthesis using synthesis 
techniques known in the art. If the TNFRSF 1 IB protein variant is of insufficient size to be antigenic, it 
may be conjugated, complexed, or otherwise covalently linked to a carrier molecule to enhance the 

25 antigenicity of the peptide. Examples of carrier molecules, include, but are not limited to, albumins (e.g., 
human, bovine, fish, ovine), and keyhole limpet hemocyanin (Basic and Clinical Immunology, 1991, Eds. 
D.P. Stites, and A.I. Terr, Appleton and Lange, Norwalk Connecticut, San Mateo, California). 

In one embodiment, an antibody specifically immunoreactive with one of the novel TNFRSF1 IB 
protein isoforms described herein is administered to an individual to neutralize activity of the 

30 TNFRSF 1 1 B isoform expressed by that individual. The antibody may be formulated as a pharmaceutical 
composition which includes a pharmaceutically acceptable carrier. 

Antibodies specific for and immunoreactive with one of the novel TNFRSF 1 IB protein isoform 
described herein may be used to immunoprecipitate the TNFRSF 1 IB protein variant from solution as 
well as react with TNFRSF 1 IB protein isoforms on Western or immunoblots of polyacrylamide gels on 

35 membrane supports or substrates. In another preferred embodiment, the antibodies will detect 

TNFRSF 1 IB protein isoforms in paraffin or frozen tissue sections, or in cells which have been fixed or 
unfixed and prepared on slides, coverslips, or the like, for use in immunocytochemical, 
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immunohistochemical, and immunofluorescence techniques. 

In another embodiment, an antibody specifically immunoreactive with one of the novel 
TNFRSF1 IB protein variants described herein is used in immunoassays to detect this variant in biological 
samples. In this method, an antibody of the present invention is contacted with a biological sample and 
5 the formation of a complex between the TNFRSF1 IB protein variant and the antibody is detected. As 
described, suitable immunoassays include radioimmunoassay, Western blot assay, immunofluorescent 
assay, enzyme linked immunoassay (ELISA), chemiluminescent assay, immunohistochemical assay, 
immunocytochemical assay, and the like (see, e.g., Principles and Practice of Immunoassay, 1991, Eds. 
Christopher P. Price and David J. Neoman, Stockton Press, New York, New York; Current Protocols in 

10 Molecular Biology, 1987, Eds. Ausubel et al., John Wiley and Sons, New York, New York). Standard 
techniques known in the art for ELISA are described in Methods in Immunodiagnosis, 2nd Ed., Eds. Rose 
and Bigazzi, John Wiley and Sons, New York 1980; and Campbell et al., 1984, Methods in Immunology, 
W.A. Benjamin, Inc.). Such assays may be direct, indirect, competitive, or noncompetitive as described 
in the art (see, e.g., Principles and Practice of Immunoassay, 1991, Eds. Christopher P. Price and David J. 

1 5 Neoman, Stockton Pres, NY, NY; and Oellirich, M., 1984, J. Clin. Chem. Clin. Biochem., 22:895-904). 
Proteins may be isolated from test specimens and biological samples by conventional methods, as 
described in Current Protocols in Molecular Biology, supra. 

Exemplary antibody molecules for use in the detection and therapy methods of the present 
invention are intact immunoglobulin molecules, substantially intact immunoglobulin molecules, or those 

20 portions of immunoglobulin molecules that contain the antigen binding site. Polyclonal or monoclonal 
antibodies may be produced by methods conventionally known in the art (e.g., Kohler and Milstein, 1975, 
Nature, 256:495-497; Campbell Monoclonal Antibody Technology, the Production and Characterization 
of Rodent and Human Hybridomas, 1985, In: Laboratory Techniques in Biochemistry and Molecular 
Biology, Eds. Burdon et al., Volume 13, Elsevier Science Publishers, Amsterdam). The antibodies or 

25 antigen binding fragments thereof may also be produced by^genetic engineering. The technology for 
expression of both heavy and light chain genes in E. coli is the subject of PCT patent applications, 
publication number WO 901443, WO 901443 and WO 9014424 and in Huse et al., 1989, Science, 
246: 1275-1 281 . The antibodies may also be humanized (e.g., Queen, C et al. 1989 Proc. Natl. Acad. Sci. 
86;10029). 

30 Effect(s) of the polymorphisms identified herein on expression of TNFRSF1 1 B may be 

investigated by preparing recombinant cells and/or organisms, preferably recombinant animals, 
containing a polymorphic variant of the TNFRSF1 IB gene. As used herein, "expression" includes but is 
not limited to one or more of the following: transcription of the gene into precursor mRNA; splicing and 
other processing of the precursor mRNA to produce mature mRNA; mRNA stability; translation of the 

35 mature mRNA into TNFRSF1 IB protein (including codon usage and tRNA availability); and 

glycosylation and/or other modifications of the translation product, if required for proper expression and 
function. 
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To prepare a recombinant cell of the invention, the desired TNFRSF1 IB isogene may be 
introduced into the cell in a vector such that the isogene remains extrachromosomal. In such a situation, 
the gene will be expressed by the cell from the extrachromosomal location. In a preferred embodiment, 
the TNFRSF1 IB isogene is introduced into a cell in such a way that it recombines with the endogenous 
5 TNFRSF1 IB gene present in the cell. Such recombination requires the occurrence of a double 

recombination event, thereby resulting in the desired TNFRSF1 IB gene polymorphism. Vectors for the 
introduction of genes both for recombination and for extrachromosomal maintenance are known in the 
art, and any suitable vector or vector construct may be used in the invention. Methods such as 
electroporation, particle bombardment, calcium phosphate co-precipitation and viral transduction for 

10 introducing DNA into cells are known in the art; therefore, the choice of method may lie with the 

competence and preference of the skilled practitioner. Examples of cells into which the TNFRSFl IB 
isogene may be introduced include, but are not limited to, continuous culture cells, such as COS, 
NIH/3T3, and primary or culture cells of the relevant tissue type, i.e., they express the TNFRSFl IB 
isogene. Such recombinant cells can be used to compare the biological activities of the different protein 

15 variants. 

Recombinant organisms, i.e., transgenic animals, expressing a variant TNFRSFl IB gene are 
prepared using standard procedures known in the art. Preferably, a construct comprising the variant gene 
is introduced into a nonhuman animal or an ancestor of the animal at an embryonic stage, i.e., the one-cell 
stage, or generally not later than about the eight-cell stage. Transgenic animals carrying the constructs of 

20 the invention can be made by several methods known to those having skill in the art. One method 

involves transfecting into the embryo a retrovirus constructed to contain one or more insulator elements, a 
gene or genes of interest, and other components known to those skilled in the art to provide a complete 
shuttle vector harboring the insulated gene(s) as a transgene, see e.g., U.S. Patent No. 5,610,053. Another 
method involves directly injecting a transgene into the embryo. A third method involves the use of 

25 embryonic stem cells. Examples of animals into which the TNFRSFl IB isogenes may be introduced 

include, but are not limited to, mice, rats, other rodents, and nonhuman primates (see "The Introduction of 
Foreign Genes into Mice" and the cited references therein, In: Recombinant DNA, Eds. J.D. Watson, M. 
Gilman, J. Witkowski, and M. Zoller; W.H. Freeman and Company, New York, pages 254-272). 
Transgenic animals stably expressing a human TNFRSFl IB isogene and producing human TNFRSFl IB 

30 protein can be used as biological models for studying diseases related to abnormal TNFRSFl IB 

expression and/or activity, and for screening and assaying various candidate drugs, compounds, and 
treatment regimens to reduce the symptoms or effects of these diseases. 

An additional embodiment of the invention relates to pharmaceutical compositions for treating 
disorders affected by expression or function of a novel TNFRSFl IB isogene described herein. The 

35 pharmaceutical composition may comprise any of the following active ingredients: a polynucleotide 

comprising one of these novel TNFRSFl IB isogenes; an antisense oligonucleotide directed against one of 
the novel TNFRSFl IB isogenes, a polynucleotide encoding such an antisense oligonucleotide, or another 
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compound which inhibits expression of a novel TNFRSF1 IB isogene described herein. Preferably, the 
composition contains the active ingredient in a therapeutically effective amount. By therapeutically 
effective amount is meant that one or more of the symptoms relating to disorders affected by expression 
or function of a novel TNFRSF1 IB isogene is reduced and/or eliminated. The composition also 
5 comprises a pharmaceutical^ acceptable carrier, examples of which include, but are not limited to, saline, 
buffered saline, dextrose, and water. Those skilled in the art may employ a formulation most suitable for 
the active ingredient, whether it is a polynucleotide, oligonucleotide, protein, peptide or small molecule 
antagonist. The pharmaceutical composition may be administered alone or in combination with at least 
one other agent, such as a stabilizing compound. Administration of the pharmaceutical composition may 

10 be by any number of routes including, but not limited to oral, intravenous, intramuscular, intra-arterial, 
intramedullary, intrathecal, intraventricular, intradermal, transdermal, subcutaneous, intraperitoneal, 
intranasal, enteral, topical, sublingual, or rectal. Further details on techniques for formulation and 
administration may be found in the latest edition of Remington's Pharmaceutical Sciences (Maack 
Publishing Co., Easton, PA). 

1 5 For any composition, determination of the therapeutically effective dose of active ingredient 

and/or the appropriate route of administration is well within the capability of those skilled in the art. For 
example, the dose can be estimated initially either in cell culture assays or in animal models. The animal 
model may also be used to determine the appropriate concentration range and route of administration. 
Such information can then be used to determine useful doses and routes for administration in humans. 

20 The exact dosage will be determined by the practitioner, in light of factors relating to the patient requiring 
treatment, including but not limited to severity of the disease state, general health, age, weight and gender 
of the patient, diet, time and frequency of administration, other drugs being taken by the patient, and 
tolerance/response to the treatment. 

Information on the identity of genotypes and haplotypes for the TNFRSF1 IB gene of any 

25 particular individual as well as the frequency of such genotypes and haplotypes in any particular 

population of individuals is expected to be useful for a variety of basic research and clinical applications. 
Thus, the invention also provides compositions and methods for detecting the novel TNFRSF1 IB 
polymorphisms identified herein. 

The compositions comprise at least one TNFRSF1 IB genotyping oligonucleotide. In one 

30 embodiment, a TNFRSF1 I B genotyping oligonucleotide is a probe or primer capable of hybridizing to a 
target region that is located close to, or that contains, one of the novel polymorphic sites described herein. 
As used herein, the term "oligonucleotide" refers to a polynucleotide molecule having less than about 100 
nucleotides. A preferred oligonucleotide of the invention is 10 to 35 nucleotides long. More preferably, 
the oligonucleotide is between 15 and 30, and most preferably, between 20 and 25 nucleotides in length. 

35 The oligonucleotide may be comprised of any phosphorylation state of ribonucleotides, 

deoxyribonucleotides, and acyclic nucleotide derivatives, and other functionally equivalent derivatives. 
Alternatively, oligonucleotides may have a phosphate-free backbone, which may be comprised of 
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linkages such as carboxymethyl, acetamidate, carbamate, polyamide (peptide nucleic acid (PNA)) and the 
like (Varma, R. in Molecular Biology and Biotechnology, A Comprehensive Desk Reference, Ed. R. 
Meyers, VCH Publishers, Inc. (1995), pages 617-620). Oligonucleotides of the invention may be 
prepared by chemical synthesis using any suitable methodology known in the art, or may be derived from 
5 a biological sample, for example, by restriction digestion. The oligonucleotides may be labeled, 

according to any technique known in the art, including use of radiolabels, fluorescent labels, enzymatic 
labels, proteins, haptens, antibodies, sequence tags and the like. 

Genotyping oligonucleotides of the invention must be capable of specifically hybridizing to a 
target region of a TNFRSF1 IB polynucleotide, i.e., a TNFRSF1 IB isogene. As used herein, specific 

10 hybridization means the oligonucleotide forms an anti-parallel double-stranded structure with the target 
region under certain hybridizing conditions, while failing to form such a structure when incubated with a 
non-target region or a non-TNFRSFl IB polynucleotide under the same hybridizing conditions. 
Preferably, the oligonucleotide specifically hybridizes to the target region under conventional high 
stringency conditions. The skilled artisan can readily design and test oligonucleotide probes and primers 

1 5 suitable for detecting polymorphisms in the TNFRSF1 IB gene using the polymorphism information 
provided herein in conjunction with the known sequence information for the TNFRSF1 IB gene and 
routine techniques. 

A nucleic acid molecule such as an oligonucleotide or polynucleotide is said to be a "perfect" or 
"complete" complement of another nucleic acid molecule if every nucleotide of one of the molecules is 

20 complementary to the nucleotide at the corresponding position of the other molecule. A nucleic acid 
molecule is "substantially complementary" to another molecule if it hybridizes to that molecule with 
sufficient stability to remain in a duplex form under conventional low-stringency conditions. 
Conventional hybridization conditions are described, for example, by Sambrook J. et al., in Molecular 
Cloning, A Laboratory Manual, 2 nd Edition, Cold Spring Harbor Press, Cold Spring Harbor, NY (1989) 

25 and by Haymes, B.D. et al. in Nucleic Acid Hybridization, A Practical Approach, IRL Press, Washington, 
D.C. (1985). While perfectly complementary oligonucleotides are preferred for detecting 
polymorphisms, departures from complete complementarity are contemplated where such departures do 
not prevent the molecule from specifically hybridizing to the target region. For example, an 
oligonucleotide primer may have a non-complementary fragment at its 5' end, with the remainder of the 

30 primer being complementary to the target region. Alternatively, non-complementary nucleotides may be 
interspersed into the oligonucleotide probe or primer as long as the resulting probe or primer is still 
capable of specifically hybridizing to the target region. 

Preferred genotyping oligonucleotides of the invention are allele-specific oligonucleotides. As 
used herein, the term allele-specific oligonucleotide (ASO) means an oligonucleotide that is able, under 

35 sufficiently stringent conditions, to hybridize specifically to one allele of a gene, or other locus, at a target 
region containing a polymorphic site while not hybridizing to the corresponding region in another 
allele(s). As understood by the skilled artisan, allele-specificity will depend upon a variety of readily 
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optimized stringency conditions, including salt and formamide concentrations, as well as temperatures for 
both the hybridization and washing steps. Examples of hybridization and washing conditions typically 
used for ASO probes are found in Kogan et al., "Genetic Prediction of Hemophilia A" in PCR Protocols, 
A Guide to Methods and Applications, Academic Press, 1990 and Ruano et al., 87 Proc. Natl. Acad. Sci. 
5 USA 6296-6300, 1990. Typically, an allele-specific oligonucleotide will be perfectly complementary to 
one allele while containing a single mismatch for another allele. 

Allele-specific oligonucleotide probes which usually provide good discrimination between 
different alleles are those in which a central position of the oligonucleotide probe aligns with the 
polymorphic site in the target region (e.g., approximately the 7 th or 8 th position in a 15 mer, the 8 th or 9 U| 
10 position in a 16mer, the 10 th or 1 1 th position in a 20 mer). A preferred ASO probe for detecting 

TNFRSF1 IB gene polymorphisms comprises a nucleotide sequence, listed 5' to 3', selected from the 
group consisting of: 



Accession No. :AB008821 



15 


ACTTGAAGATGAATG 
ACTTGAAAATGAATG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


5) and its complement, 
6) and its complement, 




GATCTTGGCTGGATC 
GATCTTGTCTGGATC 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


7) 
8) 


and its 
and its 


complement, 
complement, 


20 


CCACCGCCCCACCCC 
CCACCGCTCCACCCC 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


9) 
10) 


and its 
and its 


complement, 
complement, 


25 


TCCCTGGGGGATCCT 
TCCCTGGTGGATCCT 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


11) 
12) 


and 
and 


its 
its 


complement, 
complement, 


GCGTTAATCCTGGAG 
GCGTTAACCCTGGAG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


13) 
14) 


and 
and 


its 
its 


complement , 
complement , 


30 


CCTGGGCCAGCCGAC 
CCTGGGCTAGCCGAC 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


15) 
16) 


and 
and 


its 
its 


complement, 
complement, 




GGGAGAAGGCTCCAC 
GGGAGAAAGCTCCAC 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


17) 
18) 


and 
and 


its 
its 


complement , 
complement, 


35 


CCTTTTACGCTGCAA 
CCTTTTAAGCTGCAA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


19) 
20) 


and 
and 


its 
its 


complement , 
complement, 


40 
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GCTGGTACGTGTCAA (SEQ 
GCTGGTATGTGTCAA ( SEQ 


ID 
ID 


NO: 
NO: 


21) 
22) 


and 
and 


its 
its 


complement , 
complement, 




AGGACCATTGCTCAG 
AGGACCACTGCTCAG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


23) 
24) 


and 
and 


its 
its 


complement , 
complement , 


45 


AACATAATAGTAGCA 
AACATAACAGTAGCA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


25) 
26) 


and 
and 


its 
its 


complement, 
complement , 




TATTTT C CGTAGGAA 
TATTTT C TGTAGGAA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


27) 
28) 


and 
and 


its 
its 


complement, 
complement , 
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10 



15 



20 



25 



30 



35 



40 



45 



CATTTTAGCATATTT 
CATTTTAACATATTT 



(SEQ ID NO: 29) and its complement, 
(SEQ ID NO: 30) and its complement, 



AAGTAAACGCAGAGA (SEQ ID NO: 31) and its complement, 
AAGTAAATGCAGAGA (SEQ ID NO: 32) and its complement, 



GTGTAGAGAGGATAA 
GTGTAGAAAGGATAA 



(SEQ ID NO: 33) and its complement, 
(SEQ ID NO: 34) and its complement, 



AGAGGATAAAACGGC (SEQ ID NO: 35) and its complement, 
AGAGGATGAAACGGC (SEQ ID NO: 36) and its complement, 



TGAAGTTATGGAAAC (SEQ ID NO: 
TGAAGTTGTGGAAAC (SEQ ID NO: 



TGAAGTTATGGAAAC 
TGAAGTTGTGGAAAC 

GTATGATAATCTAAA 
GTATGATCATCTAAA 



37) and its complement, 

38) and its complement, 

(SEQ ID NO: 39) and its complement, 
(SEQ ID NO: 40) and its complement, 

(SEQ ID NO: 41) and its complement, 
(SEQ ID NO: 42) and its complement, 



TAGTTACGGCAATTA SEQ ID NO: 43) and its complement, 
TAGTTACAGCAATTA SEQ ID NO: 44) and its complement, . 

AGAGTGATGTGTCTT SEQ ID NO: 45) and its complement, 
AGAGTGACGTGTCTT SEQ ID NO: 46) and its complement, 

GGCATTGGATGAATAT SEQ ID NO: 47) and its complement,. 
GGCATTGTATGAATAT SEQ ID NO: 48) and its complement, 



AAACAGCGTGCAGCG 
AAACAGCATGCAGCG 



(SEQ ID NO: 49) and its complement, 
(SEQ ID NO: 50) and its complement, 



AAAGAAGACCATCAG SEQ ID NO: 51) and its complement, 
AAAGAAGTCCATCAG SEQ ID NO: 52) and its complement, 



TCAGAAGTTATTTTT 
TCAGAAGCTATTTTT 



(SEQ ID NO: 53) and its complement, and 
(SEQ ID NO: 54) and its complement. 



An allele-specific oligonucleotide primer of the invention has a 3' terminal nucleotide, or 
preferably a 3' penultimate nucleotide, that is complementary to only one nucleotide of a particular SNP, 
thereby acting as a primer for polymerase-mediated extension only if the allele containing that nucleotide 
is present. Allele-specific oligonucleotide primers hybridizing to either the coding or noncoding strand 
are contemplated by the invention. A preferred ASO primer for detecting TNFRSF1 IB gene 
polymorphisms comprises a nucleotide sequence, listed 5' to 3', selected from the group consisting of: 



Accession No.:AB008821 
ATGTAAACTTGAAGA (SEQ ID NO: 
ATGTAAACTTGAAAA (SEQ ID NO: 



55); TTCGCAATCATTCATCT (SEQ ID NO: 56) 
57); TTCGCAATCATTCATTT (SEQ ID NO: 58) 



CGTCCGGATCTTGGC (SEQ ID NO: 59); GAGTCCGATCCAGCC (SEQ ID NO: 60) 
50 CGTCCGGATCTTGTC (SEQ ID NO: 61); GAGTCCGATCCAGAC (SEQ . ID NO: 62) 



CAGACACCACCGCCC (SEQ ID NO: 
CAGACACCACCGCTC (SEQ ID NO: 



63); GCGTGAGGGGTGGGG (SEQ ID NO: 64) 
65); GCGTGAGGGGTGGAG (SEQ ID NO: 66) 
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CCCACCTCCCTGGGG 
CCCACCTCCCTGGTG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


67) ; 
69) ; 


GCGGAAAGGATCCCC 
GCGGAAAGGATC CAC 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


68) / 
70) ; 


5 


CTGAAAGCGTTAATC 
CTGAAAGCGTTAACC 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


71) ; 
73) ; 


AGAAAGCTCCAGGAT 
AGAAAGCTCCAGGGT 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


72) ; 
74); 


10 


TAAGTCCCTGGGCCA 
TAAGTCCCTGGGCTA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


75) ; 
77) ; 


GCACCCGTCGGCTGG 
GCAC C CGT CGGCTAG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


76) ; 
78) ; 


CCGGCGGGGAGAAGG 
CCGGCGGGGAGAAAG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


79) ; 
81) ; 


GAGCGAGTGGAGCCT 
GAGCGAGTGGAGCTT 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


80) ; 
82) ; 


15 


GGGTGTCCTTTTACG 
GGGTGTCCTTTTAAG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


83) ; 
85) ; 


GGAACTTTGCAGCGT 
GGAACTTTGCAGCTT 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


84) ; 
86) ; 


20 
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GTGCAAGCTGGTACG (SEQ ID 
GTGCAAGCTGGTATG (SEQ ID 


NO: 
NO: 


87) ; 
89) ; 


TGCACATTGACACGT 
TGCACATTGACACAT 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


88) ; 
90) ; 


TTCCAAAGGACCATT 
TTCCAAAGGACCACT 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


91) ; 
93) ; 


ATTCCTCTGAGCAAT 
ATTCCTCTGAGCAGT 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


92) ; 
94) ; 


25 


TTGTGCAACATAATA 
TTGTGCAACATAACA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


95) ; 
97) ; 


TTTTACTGCTACTAT 
TTTTACTGCTACTGT 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


96) ; 
98) ; 




GCTTGGTATTTTCCG 
GCTTGGTATTTTCTG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


99) ; 
101) ; 


CTGGGGTTCCTACGG 
CTGGGGTTCCTACAG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


100) ; 
102) ; 


30 


ACTTTGCATTTTAGC 
ACTTTGCATTTTAAC 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


103) ; 
105) ; 


AAGATAAAATATGCT 
AAGATAAAATATGTT 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


104) ; 
106) ; 


35 


GCACCAAAGTAAACG 
GCACCAAAGTAAATG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


107) ; 
109) ; 


CTACACTCTCTGCGT 
CTACACTCTCTGCAT 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


108) ; 
110) ; 


CAGAGAGTGTAGAGA 
CAGAGAGTGTAGAAA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


111) 
113) 


CGCGTTTTATCCT (SEQ ID NO: 112) 
CGCGTTTTATCTT (SEQ ID NO: 114) 


40 


GTGTAGAGAGGATAA 
GTGTAGAGAGGATGA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


115) ; 
117) ; 


TGTGTTGCCGTTTTA 
TGTGTTGCCGTTTCA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


116) ; 
118) ; 




AGCTGCTGAAGTTAT 
AGCTGCTGAAGTTGT 


(SEQ' 
(SEQ 


ID 
ID 


NO: 
NO: 


119) ; 
121) ; 


TTTGATGTTTCCATA 
TTTGATGTTTCCACA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


120) ; 
122) ; 


45 


TC CAAGGT ATGATAA 
TC C AAGGT ATGATCA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


123) ; 
125) ; 


TTTTATTTTAGATTA 
TTTTATTTTAGATGA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


124) ; 
126) / 


50 


AAAGGCTAGTTACGG 
AAAGGCTAGTTACAG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


127) ; 
12 9) ; 


TGATAAGTTAATTGC 
TGATAAGTTAATTTC 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


128) 
130) 


AAACTGAGAGTGATG 
AAACTGAGAGTGACG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


131) ; 
133) ; 


GAAAATAAGACACAT 
GAAAATAAGACACGT 


(SEQ 
(SEQ 


JD 
ID 


NO: 
NO: 


132) 
134) 


55 


TATGATGGCATTGGA 
T ATGATGG CATTGTA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


135) ; 
137) ; 


CATTTATATTCATCC 
CATTTATATTCATAC 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


136) 
138) 




CTGTGAAAACAGCGT 


(SEQ 


ID 


NO: 


139) ; 


ATGTGC CGCTGCACG 


(SEQ 


ID 


NO: 


140) ; 
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(SEQ ID NO: 141); ATGTGCCGCTGCATG (SEQ ID NO: 142); 



GAGTCTAAAGAAGAC (SEQ ID NO: 143) 

GAGTCTAAAGAAGTC (SEQ ID NO: 145) 

ATTGTATCAGAAGTT (SEQ ID NO: 147) 

ATTGTATCAGAAGCT (SEQ ID NO: 14 9) 



AGGAACCTGATGGTC (SEQ ID NO: 144); 
AGGAACCTGATGGAC (SEQ ID NO: 146); 

ATTTCTAAAAATAAC (SEQ ID NO: 14 8) ; 
and ATTTCTAAAAATAGC (SEQ ID NO: 150) 



Other genotyping oligonucleotides of the invention hybridize to a target region located one to 
10 several nucleotides downstream of one of the novel polymorphic sites identified herein. Such 

oligonucleotides are useful in polymerase-mediated primer extension methods for detecting one of the 
novel polymorphisms described herein and therefore such genotyping oligonucleotides are referred to 
herein as "primer-extension oligonucleotides". In a preferred embodiment, the 3 '-terminus of a primer- 
extension oligonucleotide is a deoxynucleotide complementary to the nucleotide located immediately 
1 5 adjacent to the polymorphic site. A particularly preferred oligonucleotide primer for detecting 

TNFRSF1 IB gene polymorphisms by primer extension terminates in a nucleotide sequence, listed 5' to 
3', selected from the group consisting of: 
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TAAACTTGAA 


(SEQ 


ID 


NO: 


151) 


AATCATTCAT 


(SEQ 


ID 


NO: 


152) 


20 


CCGGATCTTG 


(SEQ 


ID 


NO: 


153) 


TCCGATCCAG 


(SEQ 


ID 


NO: 


154) 




ACACCACCGC 


(SEQ 


ID 


NO: 


155) 


TGAGGGGTGG 


(SEQ 


ID 


NO: 


156) 




ACCTCCCTGG 


(SEQ 


ID 


NO: 


157) 


GAAAGGATCC 


(SEQ 


ID 


NO: 


158) 




AAAGCGTTAA 


(SEQ 


ID 


NO: 


159) 


AAGCT CCAGG 


(SEQ 


ID 


NO: 


160) 




GTCCCTGGGC 


(SEQ 


ID 


NO: 


161) 


? CCCGTCGGCT 


(SEQ 


ID 


NO: 


162) 


25 


GCGGGGAGAA 


(SEQ 


ID 


NO: 


163) 


r CGAGTGGAGC 


(SEQ 


ID 


NO: 


164) 




TGTCCTTTTA 


(SEQ 


ID 


NO: 


165) 


ACTTTGCAGC 


(SEQ 


ID 


NO: 


166) 
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CAAGCTGGTA 


(SEQ 


ID 


NO: 


167) 


ACATTGACAC 


(SEQ 


ID 


NO: 


168) 


30 


CAAAGGACCA 


(SEQ 


ID 


NO: 


169) 


CCTCTGAGCA 


(SEQ 


ID 


NO: 


170) 




TGCAACATAA 


(SEQ 


ID 


NO: 


171) 


TACTGCTACT 


(SEQ 


ID 


NO: 


172) 




TGGTATTTTC 


(SEQ 


ID 


NO: 


173) , 


GGGTTCCTAC 


(SEQ 


ID 


NO: 


174) 




TTGCATTTTA 


(SEQ 


ID 


NO: 


175) , 


ATAAAATATG 


(SEQ 


ID 


NO: 


176) , 




C CAAAGTAAA 


(SEQ 


ID 


NO: 


177) , 


CACTCTCTGC 


(SEQ 


ID 


NO: 


178) , 


35 


AGAGTGTAGA 


(SEQ 


ID 


NO: 


179) , 


GTTTTATCCT 


(SEQ 


ID 


NO: 


180) 




TAGAGAGGAT 


(SEQ 


ID 


NO: 


181) , 


GTTGCCGTTT 


(SEQ 


ID 


NO: 


182) , 




TGCTGAAGTT 


(SEQ 


ID 


NO: 


183) , 


GATGTTTCCA 


(SEQ 


ID 


NO: 


184) ( 




AAGGTATGAT 


(SEQ 


ID 


NO: 


185) , 


TATTTTAGAT 


(SEQ 


ID 


NO: 


186) , 




GGCTAGTTAC 


(SEQ 


ID 


NO: 


187) , 


AGTTAATTGC 


(SEQ 


ID 


NO: 


188) , 


40 


CTGAGAGTGA 


(SEQ 


ID 


NO: 


189) , 


AATAAGACAC 


(SEQ 


ID 


NO: 


190) , 




GATGGCATTG 


(SEQ 


ID 


NO: 


191) , 


TTATATTCAT 


(SEQ 


ID 


NO: 


192) , 




TGAAAACAGC 


(SEQ 


ID 


NO: 


193) , 


TGCCGCTGCA 


(SEQ 


ID 


NO: 


194) , 




TCTAAAGAAG 


(SEQ 


ID 


NO: 


195) , 


AACCTGATGG 


(SEQ 


ID 


NO: 


196) , 




GTATCAGAAG 


(SEQ 


ID 


NO: 


197) , 


and TCTAAAAATA 


(SEQ ID NO 


: 198 



45 



In some embodiments, a composition contains two or more differently labeled genotyping 
oligonucleotides for simultaneously probing the identity of nucleotides at two or more polymorphic sites. 
It is also contemplated that primer compositions may contain two or more sets of allele-specific primer 
pairs to allow simultaneous targeting and amplification of two or more regions containing a polymorphic 
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site. 

TNFRSF1 IB genotyping oligonucleotides of the invention may also be immobilized on or 
synthesized on a solid surface such as a microchip, bead, or glass slide (see, e.g., WO 98/20020 and WO 
98/20019). Such immobilized genotyping oligonucleotides may be used in a variety of polymorphism 
5 detection assays, including but not limited to probe hybridization and polymerase extension assays. 

Immobilized TNFRSF1 IB genotyping oligonucleotides of the invention may comprise an ordered array 
of oligonucleotides designed to rapidly screen a DNA sample for polymorphisms in multiple genes at the 
same time. 

In another embodiment, the invention provides a kit comprising at least two genotyping 

10 oligonucleotides packaged in separate containers. The kit may also contain other components such as 
hybridization buffer (where the oligonucleotides are to be used as a probe) packaged in a separate 
container. Alternatively, where the oligonucleotides are to be used to amplify a target region, the kit may 
contain, packaged in separate containers, a polymerase and a reaction buffer optimized for primer 
extension mediated by the polymerase, such as PCR. 

1 5 The above described oligonucleotide compositions and kits are useful in methods for genotyping 

and/or haplotyping the TNFRSF1 IB gene in an individual. As used herein, the terms "TNFRSF1 IB 
genotype" and 'TNFRSF 1 IB haplotype" mean the genotype or haplotype contains the nucleotide pair or 
nucleotide, respectively, that is present at one or more of the novel polymorphic sites described herein 
and may optionally also include the nucleotide pair or nucleotide present at one or more additional 

20 polymorphic sites in the TNFRSF 1 IB gene. The additional polymorphic sites may be currently known 
polymorphic sites or sites that are subsequently discovered. 

One embodiment of the genotyping method involves isolating from the individual a nucleic acid 
mixture comprising the two copies of the TNFRSF 1 IB gene, or a fragment thereof, that are present in the 
individual, and determining the identity of the nucleotide pair at one or more of the polymorphic sites 

25 selected from PS1, PS2, PS3, PS4, PS5, PS7, PS8, PS9, PS10, PSll, PS12, PS13, PS14, PS15, PS16, 

PS17, PS18, PS19, PS20, PS21, PS22, PS23, PS24, and PS 25 in the two copies to assign a TNFRSF1 IB 
genotype to the individual. As will be readily understood by the skilled artisan, the two "copies" of a 
gene in an individual may be the same allele or may be different alleles. In a preferred embodiment of the 
genotyping method, the identity of the nucleotide pair at PS6 is also determined. In a particularly 

30 preferred embodiment, the genotyping method comprises determining the identity of the nucleotide pair 
at each of PS 1-25. 

Typically, the nucleic acid mixture is isolated from a biological sample taken from the individual, 
such as a blood sample or tissue sample. Suitable tissue samples include whole blood, semen saliva, 
tears, urine, fecal material, sweat, buccal, skin and hair. The nucleic acid mixture may be comprised of 
35 genomic DNA, mRNA, or cDNA and, in the latter two cases, the biological sample must be obtained 
from an organ in which the TNFRSF 1 1 B gene is expressed. Furthermore it will be understood by the 
skilled artisan that mRNA or cDNA preparations would not be used to detect polymorphisms located in 
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introns or in 5' and 3' nontranscribed regions. If a TNFRSF1 IB gene fragment is isolated, it must 
contain the polymorphic site(s) to be genotyped. 

One embodiment of the haplotyping method comprises isolating from the individual a nucleic 
acid molecule containing only one of the two copies of the TNFRSF1 IB gene, or a fragment thereof, that 
5 is present in the individual and determining in that copy the identity of the nucleotide at one or more of 
the polymorphic sites PS1, PS2, PS3, PS4, PS5, PS7, PS8, PS9, PS10, PS1 1, PS12, PS13, PS14, PS15, 
PS16, PS17, PS18, PS19, PS20, PS21, PS22, PS23, PS24, and PS 25 in that copy to assign a 
TNFRSF1 IB haplotype to the individual. The nucleic acid may be isolated using any method capable of 
separating the two copies of the TNFRSF1 IB gene or fragment such as one of the methods described 

1 0 above for preparing TNFRSF1 IB isogenes, with targeted in vivo cloning being the preferred approach. 
As will be readily appreciated by those skilled in the art, any individual clone will only provide haplotype 
information on one of the two TNFRSF1 IB gene copies present in an individual. If haplotype 
information is desired for the individual's other copy, additional TNFRSFI IB clones will need to be 
examined. Typically, at least five clones should be examined to have more than a 90% probability of 

1 5 haplotyping both copies of the TNFRSFI IB gene in an individual. In some embodiments, the 
haplotyping method also comprises identifying the nucleotide at PS6. In a particularly preferred 
embodiment, the nucleotide at each of PS 1-25 is identified. 

In a preferred embodiment, a TNFRSFI IB haplotype pair is determined for an individual by 
identifying the phased sequence of nucleotides at one or more of the polymorphic sites selected from PS1, 

20 PS2, PS3, PS4, PS5, PS7, PS8, PS9, PS10, PS1 1, PS12, PS13, PS14, PS15, PS16, PS17, PS18, PS19, 
PS20, PS21, PS22, PS23, PS24, and PS 25 in each copy of the TNFRSFI IB gene that is present in the 
individual. In a particularly preferred embodiment, the haplotyping method comprises identifying the 
phased sequence of nucleotides at each of PS1-25 in each copy of the TNFRSFI IB gene. When 
haplotyping both copies of the gene, the identifying step is preferably performed with each copy of the 

25 gene being placed in separate containers. However, it is also envisioned that if the two copies are labeled 
with different tags, -or are otherwise separately distinguishable or identifiable, it could be possible in some 
cases to perform the method in the same container; For example, if first and second copies of the gene 
are labeled with different first and second fluorescent dyes, respectively, and an allele-specific 
oligonucleotide labeled with yet a third different fluorescent dye is used to assay the polymorphic site(s), 

30 then detecting a combination of the first and third dyes would identify the polymorphism in the first gene 
copy while detecting a combination of the second and third dyes would identify the polymorphism in the 
second gene copy. 

In both the genotyping and haplotyping methods, the identity of a nucleotide (or nucleotide pair) 
at a polymorphic site(s) may be determined by amplifying a target region(s) containing the polymorphic 
35 site(s) directly from one or both copies of the TNFRSFI IB gene, or fragment thereof, and the sequence of 
the amplified region(s) determined by conventional methods. It will be readily appreciated by the skilled 
artisan that only one nucleotide will be detected at a polymorphic site in individuals who are homozygous 



26 



WO 01/04137 PCT/US00/18803 
at that site, while two different nucleotides will be detected if the individual is heterozygous for that site. 
The polymorphism may be identified directly, known as positive-type identification, or by inference, 
referred to as negative-type identification. For example, where a SNP is known to be guanine and 
cytosine in a reference population, a site may be positively determined to be either guanine or cytosine for 
5 an individual homozygous at that site, or both guanine and cytosine, if the individual is heterozygous at 
that site. Alternatively, the site may be negatively determined to be not guanine (and thus 
cytosine/cytosine) or not cytosine (and thus guanine/guanine). 

In addition, the identity of the allele(s) present at any of the novel polymorphic sites described 
herein may be indirectly determined by genotyping a polymorphic site not disclosed herein that is in 

10 linkage disequilibrium with the polymorphic site that is of interest. Two sites are said to be in linkage 
disequilibrium if the presence of a particular variant at one site enhances the predictability of another 
variant at the second site (Stevens, JC 1999, Mol Diag. 4: 309-17). Polymorphic sites in linkage 
disequilibrium with the presently disclosed polymorphic sites may be located in regions of the gene or in 
other genomic regions not examined herein. Genotyping of a polymorphic site in linkage disequilibrium 

15 with the novel polymorphic sites described herein may be performed by, but is not limited to, any of the 
above-mentioned methods for detecting the identity of the allele at a polymorphic site. 

The target region(s) may be amplified using any oligonucleotide-directed amplification method, 
including but not limited to polymerase chain reaction (PCR) (U.S. Patent No. 4,965,188), ligase chain 
reaction (LCR) (Barany et al., Proc. Natl. Acad. Sci. USA 88:189-193, 1991; WO90/01069), and 

20 oligonucleotide ligation assay (OLA) (Landegren et al., Science 241 : 1077-1080, 1988). Oligonucleotides 
useful as primers or probes in such methods should specifically hybridize to a region of the nucleic acid 
that contains or is adjacent to the polymorphic site. Typically, the oligonucleotides are between 10 and 
35 nucleotides in length and preferably, between 15 and 30 nucleotides in length. Most preferably, the 
oligonucleotides are 20 to 25 nucleotides long. The exact length of the oligonucleotide will depend on 

25 many factors that are routinely considered and practiced by the skilled artisan. 

Other known nucleic acid amplification procedures may be used to amplify the target region 
including transcription-based amplification systems (U.S. Patent No. 5,130,238; EP 329,822; U.S. Patent 
No. 5,169,766, WO89/06700) and isothermal methods (Walker et al., Proc. Natl. Acad. Sci. USA 89:392- 
396, 1992). 

30 A polymorphism in the target region may also be assayed before or after amplification using one 

of several hybridization-based methods known in the art. Typically, allele-specific oligonucleotides are 
utilized in performing such methods. The allele-specific oligonucleotides may be used as differently 
labeled probe pairs, with one member of the pair showing a perfect match to one variant of a target 
sequence and the other member showing a perfect match to a different variant. In some embodiments, 

35 more than one polymorphic site may be detected at once using a set of allele-specific oligonucleotides or 
oligonucleotide pairs. Preferably, the members of the set have melting temperatures within 5°C, and 
more preferably within 2°C, of each other when hybridizing to each of the polymorphic sites being 
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detected. 

Hybridization of an allele-specific oligonucleotide to a target polynucleotide may be performed 
with both entities in solution, or such hybridization may be performed when either the oligonucleotide or 
the target polynucleotide is covalently or noncovalently affixed to a solid support. Attachment may be 
5 mediated, for example, by antibody-antigen interactions, poly-L-Lys, streptavidin or avidin-biotin, salt 
bridges, hydrophobic interactions, chemical linkages, UV cross-linking baking, etc. Allele-specific 
oligonucleotides may be synthesized directly on the solid support or attached to the solid support 
subsequent to synthesis. Solid-supports suitable for use in detection methods of the invention include 
substrates made of silicon, glass, plastic, paper and the like, which may be formed, for example, into 
10 wells (as in 96- well plates), slides, sheets, membranes, fibers, chips, dishes, and beads. The solid support 
may be treated, coated or derivatized to facilitate the immobilization of the allele-specific oligonucleotide 
or target nucleic acid. 

The genotype or haplotype for the TNFRSF1 IB gene of an individual may also be determined by 
hybridization of a nucleic sample containing one or both copies of the gene to nucleic acid arrays and 

15 subarrays such as described in WO 95/1 1995. The arrays would contain a battery of allele-specific 

oligonucleotides representing each of the polymorphic sites to be included in the genotype or haplotype. 

The identity of polymorphisms may also be determined using a mismatch detection technique, 
including but not limited to the RNase protection method using riboprobes (Winter et al., Proc. Natl. 
Acad. Sci. USA 82:7575, 1985; Meyers et al., Science 230:1242, 1985) and proteins which recognize 

20 nucleotide mismatches, such as the E. coli mutS protein (Modrich, P. Ann. Rev. Genet. 25:229-253, 
1991). Alternatively, variant alleles can be identified by single strand conformation polymorphism 
(SSCP) analysis (Orita et al., Genomics 5:874-879, 1989; Humphries et al., in Molecular Diagnosis of 
Genetic Diseases, R. Elles, ed., pp. 321-340, 1996) or denaturing gradient gel electrophoresis (DGGE) 
(Wartell et al., Nucl. Acids Res. 18:2699-2706, 1990; Sheffield et al., Proc. Natl. Acad. Sci. USA 86:232- 

25 236, 1989). 

A polymerase-mediated primer extension method may also be used to identify the 
polymorphism(s). Several such methods have been described in the patent and scientific literature and 
include the "Genetic Bit Analysis" method (W092/15712) and the ligase/polymerase mediated genetic bit 
analysis (U.S. Patent 5,679,524. Related methods are disclosed in WO91/02087, WO90/09455, 

30 W095/1 7676, U.S. Patent Nos. 5,302,509, and 5,945,283. Extended primers containing a polymorphism 
may be detected by mass spectrometry as described in U.S. Patent No. 5,605,798. Another primer 
extension method is allele-specific PCR (Ruafio et al., Nucl. Acids Res. 17:8392, 1989; Ruano et al., 
Nucl. Acids Res. 19, 6877-6882, 1991; WO 93/22456; Turki et al., J. Clin. Invest. 95:1635-1641, 1995). 
In addition, multiple polymorphic sites may be investigated by simultaneously amplifying multiple 

35 regions of the nucleic acid using sets of allele-specific primers as described in Wallace et al. 
(WO89/10414). 

In another aspect of the invention, an individual's TNFRSF1 IB haplotype pair is predicted from 
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its TNFRSF1 IB genotype using information on haplotype pairs known to exist in a reference population. 
In its broadest embodiment, the haplotyping prediction method comprises identifying a TNFRSF1 IB 
genotype for the individual at two or more polymorphic sites selected from PS I, PS2, PS3, PS4, PS5, 
PS7, PS8, PS9, PS10, PS1 1, PS12, PS13, PS14, PS15, PS16, PS17, PS18, PS19, PS20, PS21, PS22, 
5 PS23, PS24, and PS 25, enumerating all possible haplotype pairs which are consistent with the genotype, 
accessing data containing TNFRSF1 IB haplotype pairs identified in a reference population, and assigning 
a haplotype pair to the individual that is consistent with the data. In one embodiment, the reference 
haplotype pairs include the TNFRSF1 IB haplotype pairs shown in Table 4. 

Generally, the reference population should be composed of randomly-selected individuals 

10 representing the major ethnogeographic groups of the world. A preferred reference population for use in 
the methods of the present invention comprises an approximately equal number of individuals from 
Caucasian, African American, Asian and Hispanic-Latino population groups with the minimum number 
of each group being chosen based on how rare a haplotype one wants to be guaranteed to see. For 
example, if one wants to have a q% chance of not missing a haplotype that exists in the population at a 

15 p% frequency of occurring in the reference population, the number of individuals (n) who must be 

sampled is given by 2n=log(l-q)/log(l-p) where p and q are expressed as fractions. A preferred reference 
population allows the detection of any haplotype whose frequency is at least 10% with about 99% 
certainty and comprises about 20 unrelated individuals from each of the four population groups named 
above. A particularly preferred reference population includes a 3-generation family representing one or 

20 more of the four population groups to serve as controls for checking quality of haplotyping procedures. 

In a preferred embodiment, the haplotype frequency data for each ethnogeographic group is 
examined to determine whether it is consistent with Hardy-Weinberg equilibrium. Hardy-Weinberg 
equilibrium (D.L. Hartl et al., Principles of Population Genomics, Sinauer Associates (Sunderland, MA), 
3 rd Ed., 1997) postulates that the frequency of finding the haplotype pair H x IH 2 is equal to 

25 p H _ w {HJH 2 ) = 2p{H x )p{H 2 ) if H x * H 2 and p H ^{H x IH 2 ) = p{H x )p{H 2 ) if H t = H 2 . A 
statistically significant difference between the observed and expected haplotype frequencies could be due 
to one or more factors including significant inbreeding in the population group, strong selective pressure 
on the gene, sampling bias, and/or errors in the genotyping process. If large deviations from Hardy- 
Weinberg equilibrium are observed in an ethnogeographic group, the number of individuals in that group 

30 can be increased to see if the deviation is due to a sampling bias. If a larger sample size does not reduce 
the difference between observed and expected haplotype pair frequencies, then one may wish to consider 
haplotyping the individual using a direct haplotyping method such as, for example, CLASPER System™ 
technology (U.S. Patent No. 5,866,404), SMD, or allele-specific long-range PCR (Michalotos-Beloin et 
al., Nucleic Acids Res. 24:4841-4843, 1996). 

35 In one embodiment of this method for predicting a TNFRSF1 IB haplotype pair, the assigning 

step involves performing the following analysis. First, each of the possible haplotype pairs is compared 
to the haplotype pairs in the reference population. Generally, only one of the haplotype pairs in the 
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reference population matches a possible haplotype pair and that pair is assigned to the individual. 
Occasionally, only one haplotype represented in the reference haplotype pairs is consistent with a possible 
haplotype pair for an individual, and in such cases the individual is assigned a haplotype pair containing 
this known haplotype and a new haplotype derived by subtracting the known haplotype from the possible 
5 haplotype pair. In rare cases, either no haplotypes in the reference population are consistent with the 
possible haplotype pairs, or alternatively, multiple reference haplotype pairs are consistent with the 
possible haplotype pairs. In such cases, the individual is preferably haplotyped using a direct molecular 
haplotyping method such as, for example, CLASPER System™ technology (U.S. Patent No. 5,866,404), 
SMD, or allele-specific long-range PCR (Michalotos-Beloin et al., Nucleic Acids Res. 24:4841-4843, 
10 1996). 

The invention also provides a method for determining the frequency of a TNFRSF1 IB genotype 
or TNFRSF1 IB haplotype in a population. The method comprises determining the genotype or the 
haplotype pair for the TNFRSF1 IB gene that is present in each member of the population, wherein the 
genotype or haplotype comprises the nucleotide pair or nucleotide detected at one or more of the 

15 polymorphic sites PS1, PS2, PS3, PS4, PS5, PS7, PS8, PS9, PS10, PS1 1, PS12, PS13, PS14, PS15, PS16, 
PS 17, PS 18, PS 19, PS20, PS21, PS22, PS23, PS24, and PS 25 in the TNFRSF1 IB gene; and calculating 
the frequency any particular genotype or haplotype is found in the population. The population may be a 
reference population, a family population, a same sex population, a population group, a trait population 
(e.g., a group of individuals exhibiting a trait of interest such as a medical condition or response to a 

20 therapeutic treatment). 

In another aspect of the invention, frequency data for TNFRSF1 IB genotypes and/or haplotypes 
found in a reference population are used in a method for identifying an association between a trait and a 
TNFRSF1 IB genotype or a TNFRSF1 IB haplotype. The trait may be any detectable phenotype, 
including but not limited to susceptibility to a disease or response to a treatment. The method involves 

25 obtaining data on the frequency of the genotype(s) or haplotype(s) of interest in a reference population as 
well as in a population exhibiting the trait. Frequency data for one or both of the reference and trait 
populations may be obtained by genotyping or haplotyping each individual in the populations using one 
of the methods described above. The haplotypes for the trait population may be determined directly or, 
alternatively, by the predictive genotype to haplotype approach described above. In another embodiment, 

30 the frequency data for the reference and/or trait populations is obtained by accessing previously 

determined frequency data, which may be in written or electronic form. For example, the frequency data 
may be present in a database that is accessible by a computer. Once the frequency data is obtained, the 
frequencies of the genotype(s) or haplotype(s) of interest in the reference and trait populations arc 
compared. In a preferred embodiment, the frequencies of all genotypes and/or haplotypes observed in the 

35 populations are compared. If a particular genotype or haplotype for the TNFRSF1 IB gene is more 

frequent in the trait population than in the reference population at a statistically significant amount, then 
the trait is predicted to be associated with that TNFRSF1 IB genotype or haplotype. Preferably, the 
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TNFRSF1 IB genotype or haplotype being compared in the trait and reference populations is selected 
from the full-genotypes and full-haplotypes shown in Tables 4 and 5, respectively, or from sub-genotypes 
and sub-haplotypes derived from these genotypes and haplotypes. 

In a preferred embodiment of the method, the trait of interest is a clinical response exhibited by a 
5 patient to some therapeutic treatment, for example, response to a drug targeting TNFRSF1 1 B or response 
to a therapeutic treatment for a medical condition. As used herein, "medical condition" includes but is 
not limited to any condition or disease manifested as one or more physical and/or psychological 
symptoms for which treatment is desirable, and includes previously and newly identified diseases and 
other disorders. As used herein the term "clinical response" means any or all of the following: a 

10 quantitative measure of the response, no response, and adverse response (i.e., side effects). 

In order to deduce a correlation between clinical response to a treatment and a TNFRSF1 IB 
genotype or haplotype, it is necessary to obtain data on the clinical responses exhibited by a population of 
individuals who received the treatment, hereinafter the "clinical population". This clinical data may be 
obtained by analyzing the results of a clinical trial that has already been run and/or the clinical data may 

15 be obtained by designing and carrying out one or more new clinical trials. As used herein, the term 
"clinical trial" means any research study designed to collect clinical data on responses to a particular 
treatment, and includes but is not limited to phase I, phase II and phase III clinical trials. Standard 
methods are used to define the patient population and to enroll subjects. 

It is preferred that the individuals included in the clinical population have been graded for the 

20 existence of the medical condition of interest. This is important in cases where the symptom(s) being 
presented by the patients can be caused by more than one underlying condition, and where treatment of 
the underlying conditions are not the same. An example of this would be where patients experience 
breathing difficulties that are due to either asthma or respiratory infections. If both sets were treated with 
an asthma medication, there would be a spurious group of apparent non-responders that did not actually 

25 have asthma. These people would affect the ability to detect any correlation between haplotype and 

treatment outcome. This grading of potential patients could employ a standard physical exam or one or 
more lab tests. Alternatively, grading of patients could use haplotyping for situations where there is a 
strong correlation between haplotype pair and disease susceptibility or severity. 

The therapeutic treatment of interest is administered to each individual in the trial population and 

30 each individual's response to the treatment is measured using one or more predetermined criteria. It is 
contemplated that in many cases, the trial population will exhibit a range of responses and that the 
investigator will choose the number of responder groups (e.g., low, medium, high) made up by the 
various responses. In addition, the TNFRSF1 IB gene for each individual in the trial population is 
genotyped and/or haplotyped, which may be done before or after administering the treatment. 

35 After both the clinical and polymorphism data have been obtained, correlations between 

individual response and TNFRSF1 IB genotype or haplotype content are created. Correlations may be 
produced in several ways. In one method, individuals are grouped by their TNFRSF1 IB genotype or 
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haplotype (or haplotype pair) (also referred to as a polymorphism group), and then the averages and 
standard deviations of clinical responses exhibited by the members of each polymorphism group are 
calculated. 

These results are then analyzed to determine if any observed variation in clinical response 
between polymorphism groups is statistically significant. Statistical analysis methods which may be used 
are described in L.D. Fisher and G. vanBelle, "Biostatistics: A Methodology for the Health Sciences", 
Wiley-Interscience (New York) 1993. This analysis may also include a regression calculation of which 
polymorphic sites in the TNFRSF1 IB gene give the most significant contribution to the differences in 
phenotype. One regression model useful in the invention starts with a model of the form 



where r is the response, r 0 is a constant called the "intercept", S is the slope and d is the dose. To 

determine the dose, the most-common and least common nucleotides at the polymorphic site are first 
defined. Then, for each individual in the trial population, one calculates a "dose" as the number of least- 
common nucleotides the individual has at the polymorphic site of interest. This value can be 0 
(homozygous for the least-common nucleotide), 1 (heterozygous), or 2 (homozygous for the most 
common nucleotide). An individual's "response" is the value of the clinical measurement. Standard 
linear regression methods are then used to fit all the individuals' doses and responses to a single model 
(see e.g., L.D. Fisher and G. vanBelle, supra, Ch 9). The outputs of the regression calculation are the 
intercept r 0 , the slope S, and the variance (which measures how well the data fits this simple linear 
model). The Students t-test value and the level of significance can then be calculated for each of the 
polymorphic sites. 

A second method for finding correlations between TNFRSF1 IB haplotype content and clinical 
responses uses predictive models based on error-minimizing optimization algorithms. One of many 
possible optimization algorithms is a genetic algorithm (R. Judson, "Genetic Algorithms and Their Uses 
in Chemistry" in Reviews in Computational Chemistry, Vol. 10, pp. 1-73, K. B. Lipkowitz and D. B. 
Boyd, eds. (VCH Publishers, New York, 1997). Simulated annealing (Press et al., "Numerical Recipes in 
C: The Art of Scientific Computing", Cambridge University Press (Cambridge) 1992, Ch. 10), neural 
networks (E. Rich and K. Knight, "Artificial Intelligence", 2 nd Edition (McGraw-Hill, New York, 1991, 
Ch. 1 8), standard gradient descent methods (Press et al., supra Ch. 10), or other global or local 
optimization approaches (see discussion in Judson, supra) could also be used. As an example, a genetic 
algorithm approach is described herein. This method searches for optimal parameters or weights in linear 
or non-linear models connecting TNFRSF1 IB haplotype loci and clinical outcome. One model is of the 
form 



where C is the measured clinical outcome, i goes over all polymorphic sites, a over all candidate genes, 
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C 0 , w u and W iA are variable weight values, R i a is equal to 1 if site i in gene a in the first haplotype 
takes on the most common nucleotide and -1 if it takes on the less common nucleotide. L i a is the same 
as R ia except for the second haplotype. The constant term C 0 and the weights w i a and W ia are varied 

by the genetic algorithm during a search process that minimizes the error between the measured value of 
5 C and the value calculated from Equation 1 . Models other than the one given in Equation 1 can be readily 
incorporated by those skilled in the art for analyzing the clinical and polymorphism data. The genetic 
algorithm is especially suited for searching not only over the space of weights in a particular model but 
also over the space of possible models (Judson, supra). 

Correlations may also be analyzed using analysis of variation (ANOVA) techniques to determine 

10 how much of the variation in the clinical data is explained by different subsets of the polymorphic sites in 
the TNFRSF1 IB gene. ANOVA is used to test hypotheses about whether a response variable is caused 
by or correlated with one or more traits or variables that can be measured (Fisher and vanBelle, supra, Ch. 
10). These traits or variables are called the independent variables. To carry out ANOVA, the 
independent variable(s) are measured and individuals are placed into groups based on their values for 

15 these variables. In this case, the independent variable(s) refers to the combination of polymorphisms 

present at a subset of the polymorphic sites, and thus, each group contains those individuals with a given 
genotype or haplotype pair. The variation in response within the groups and also the variation between 
groups is then measured. If the within-group response variation is large (people in a group have a wide 
range of responses) and the response variation between groups is small (the average responses for all 

20 groups are about the same) then it can be concluded that the independent variables used for the grouping 
are not causing or correlated with the response variable. For instance, if people are grouped by month of 
birth (which should have nothing to do with their response to a drug) the ANOVA calculation should 
show a low level of significance. However, if the response variation is larger between groups than within 
groups, the F-ratio (="between groups" divided by "within groups") is greater than one. Large values of 

25 the F-ratio indicate that the independent variable is causing or correlated with the response. The 
calculated F-ratio is preferably compared with the critical F-distribution value at whatever level of 
significance is of interest. If the F-ratio is greater than the Critical F-distribution value, then one may be 
confident that the individual's genotype or haplotype pair for this particular subset of polymorphic sites in 
the TNFRSF1 IB gene is at least partially responsible for, or is at least strongly correlated with the clinical 

30 response. 

From the analyses described above, a mathematical model may be readily constructed by the 
skilled artisan that predicts clinical response as a function of TNFRSF1 IB genotype or haplotype content. 
Preferably, the model is validated in one or more follow-up clinical trials designed to test the model. 

The identification of an association between a clinical response and a genotype or haplotype (or 
35 haplotype pair) for the TNFRSF1 IB gene may be the basis for designing a diagnostic method to 

determine those individuals who will or will not respond to the treatment, or alternatively, will respond at 
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a lower level and thus may require more treatment, i.e., a greater dose of a drug. The diagnostic method 
may take one of several forms: for example, a direct DNA test (i.e., genotyping or haplotyping one or 
more of the polymorphic sites in the TNFRSF1 IB gene), a serological test, or a physical exam 
measurement. The only requirement is that there be a good correlation between the diagnostic test results 
5 and the underlying TNFRSF1 IB genotype or haplotype that is in turn correlated with the clinical 

response. In a preferred embodiment, this diagnostic method uses the predictive haplotyping method 
described above. 

Any or all analytical and mathematical operations involved in practicing the methods of the 
present invention may be implemented by a computer. In addition, the computer may execute a program 

10 that generates views (or screens) displayed on a display device and with which the user can interact to 
view and analyze large amounts of information relating to the TNFRSF1 IB gene and its genomic 
variation, including chromosome location, gene structure, and gene family, gene expression data, 
polymorphism data, genetic sequence data, and clinical data population data (e.g., data on 
ethnogeographic origin, clinical responses, genotypes, and haplotypes for one or more populations). The 

1 5 TNFRSF1 IB polymorphism data described herein may be stored as part of a relational database (e.g., an 
instance of an Oracle database or a set of ASCII flat files). These polymorphism data may be stored on 
the computer's hard drive or may, for example, be stored on a CD ROM or on one or more other storage 
devices accessible by the computer. For example, the data may be stored on one or more databases in 
communication with the computer via a network. 

20 Preferred embodiments of the invention are described in the following examples. Other 

embodiments within the scope of the claims herein will be apparent to one skilled in the art from 
consideration of the specification or practice of the invention as disclosed herein. It is intended that the 
specification, together with the examples, be considered exemplary only, with the scope and spirit of the 
invention being indicated by the claims which follow the examples. 

25 

EXAMPLES 

The Examples herein are meant to exemplify the various aspects of carrying out the invention and 
are not intended to limit the scope of the invention in any way. The Examples do not include detailed 
descriptions for conventional methods employed, such as in the performance of genomic DNA isolation, 
30 PCR and sequencing procedures. Such methods are well-known to those skilled in the art and are 

described in numerous publications, for example, Sambrook, Fritsch, and Maniatis, "Molecular Cloning: 
A Laboratory Manual", 2 nd Edition, Cold Spring Harbor Laboratory Press, USA, (1989). 

Example 1A 

35 This example illustrates examination of various regions of the TNFRSF1 IB gene for 

polymorphic sites using DNA from Index Repository IA. 
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Amplification of Target Regions 

The following target regions of the TNFRSF1 IB gene were amplified using the PCR primer pairs 
listed below, with the sequences presented in the 5' to 3' direction and nucleotide positions shown for 
each region corresponding to the indicated GenBank Accession No. 

5 

Accession Number: AB008821 
Fragment 1 
Forward Primer 

599-6 1 8 GTACGGCGGAAACTC ACAGC (SEQ ID NO: 1 99) 

10 Reverse Primer 

Complement of 1 190-1168 GCACAGCAACTTGTTCATTGTGG (SEQ ED NO:200) 
PCR product 592 nt 

Fragment 2 
15 Forward Primer 

864-883 TCTCCCAGGGGACAGACACC (SEQ ID NO:201) 

Reverse Primer 

Complement of 1389-1367 TAAAAGGACACCCTAGGGGAAGC (SEQ ED NO:202) 
PCR product 526 nt 

20 

Fragment 3 
Forward Primer 

949-970 ATCCTGGAGCTTTCTGCACACC (SEQ ID NO:203) 

Reverse Primer 

25 Complement of 1609-1 584 CTGAATCTAAGGGACCACTTCTTTGC (SEQ ED NO:204) 
PCR product 661 nt 

Accession Number: AB008822 
Fragment 4 
30 Forward Primer 

95-1 18 GCTAAGATGATGCCACTGTGTTCC (SEQ ID NO:205) 

Reverse Primer 

Complement of 699-677 GCCCTGTAGTGGCAAAGTATTCC (SEQ ED NO:206) 

PCR product 605 nt 

35 

Fragment 5 
Forward Primer 

4239-4262 CCTTTCCTCTCACATTTCATGAGC (SEQ ID NO:207) 

Reverse Primer 

40 Complement of 4976-4955 CAAACTTGACACTGCCCTTTGC (SEQ ID NO:208) 
PCR product 738 nt 

Fragment 6 
Forward Primer 

45 6526-6550 ACACGTAAGCTCAGTTGGTCTCTGC (SEQ ID NO:209) 
Reverse Primer 

Complement of 7144-7123 GAGCAGGGGTGAGAACAAAACC (SEQ ID NO:210) 
PCR product 619 nt 

50 Fragment 7 

Forward Primer 

8433-8455 CTTCCAAAGTTTTGTTGGATGCC (SEQ ID NO:2 1 1 ) 
Reverse Primer 

Complement of 91 19-9096 AGTTTACTCATCCATGGGATCTCG (SEQ ID NO:212) 

35 
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PCR product 687 nt 

These primer pairs were used in PCR reactions containing genomic DNA isolated from 
immortalized cell lines for each member of Index Repository IA. The PCR reactions were carried out 
5 under the following conditions: 
Reaction volume 

10 x Advantage 2 Polymerase reaction buffer (Clontech) 
100 ng of human genomic DNA 
lOmMdNTP 

10 Advantage 2 Polymerase enzyme mix (Clontech) 
Forward Primer (10 |iM) 
Reverse Primer (10 (xM) 
Water 

15 Amplification profile: 

94°C - 2 min. 1 cycle 

94°C - 30 sec. 

70°C-45sec. 10 cycles 

20 72°C - 1 min. 

94°C - 30 sec. 

64°C - 45 sec. 35 cycles 

72°C - 1 min. 

25 Sequencing of PCR Products 

The PCR products were purified by Solid Phase Reversible Immobilization using the protocol 
developed by the Whitehead Genome Center. A detailed protocol can be found at 
http://www.genome.wi.mit.edu/sequencing/protocols/pure/SPRI_pcr.html. 

Briefly, five \il of carboxyl coated magnetic beads (10 mg/ml) and 60 \i\ of HYB BUFFER (2.5M 

30 NaCl/20% PEG 8000) were added to each PCR reaction mixture (20 ^1). The reaction mixture was 
mixed well and incubated at room temperature (RT) for 10 min. The microtitre plate was placed on a 
magnet for 2 min and the beads washed twice with 1 50 \x\ of 70% EtOH. The beads were air dried for 2 
min and the DNA was eluted in 25 \x\ of distilled water and incubated at RT for 5 min. The beads were 
magnetically separated and the supernatant removed for testing and sequencing. 

35 The purified PCR products were sequenced in both directions using the primer sets described 

previously or those listed, in the 5' to 3' direction, below. 

Accession Number: AB008821 
Fragment 1 
40 Forward Primer 

624-643 CCCAGCGAGAGGACAAAGGT (SEQ ID NO:213) 

Reverse Primer 

Complement of 1 162-1 144 GGAAACCTCAGGGGCTTGG (SEQ ID NO:214) 

45 Fragment 2 

Forward Primer 

36 



= 20 pi 
= 2\il 
= 1 |il 
= 0.4 |il 
= 0.2 nl 
= 0.4 ^1 
= 0.4 nl 
=15.6h1 
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930-950 CCCAGCCCTGAAAGCGTTAAT (SEQ ED NO:21 5) 

Reverse Primer 

Complement of 1353-1334 AAAGCGGTTTCCTGCTCCAG (SEQ ID NO:216) 

Fragment 3 
Forward Primer 

983-1002 CCGCCCAAGCTTCCTAAAAA (SEQ ID NO:2 17) 
Reverse Primer 

Complement of 1457-1436 TCTCCCTCTCTCTCGCTGTCTG (SEQ ID N0:218) 



Accession Number: AB008822 
Fragment 4 
Forward Primer 

151-170 AGTGGACCACCCAGGAAACG (SEQ IDNO:219) 

15 Reverse Primer 

Complement of 67 1 -652 GCAATGGTCCTTTGGAAGCA (SEQ ID NO:220) 

Fragment 5 
Forward Primer 

20 43 14-4333 AACTGGCAAAGGGGATGATG (SEQ ID NO:221) 
Reverse Primer 

Complement of 4843-4822 GCATCGAG AGTAGCCTTAGCTG (SEQ ID NO:222) 

, Fragment 6 
25 Forward Primer 

6558-6577 ACCAGCCAACAGAAGCTTGA (SEQ ID NO:223) 
Reverse Primer 

Complement of 7069-7048 GTCCAACAATGATTCCAACAGG (SEQ ID NO:224) 

30 Fragment 7 

Forward Primer 

8520-8541 GGTGTCACTTAACTCCCTCTCA (SEQ ID NO:225) 
Reverse Primer 

Complement of 9039-9020 CTGATTGGACCTGGTTACCT (SEQ ID NO:226) 

35 

Analysis of Sequences for Polymorphic Sites 

Sequences were analyzed for the presence of polymorphisms using the Polyphred program 
(Nickerson et al., Nucleic Acids Res. 14:2745-2751, 1997). The presence of a polymorphism was 
confirmed on both strands. The polymorphisms and their locations in the TNFRSF1 IB gene are listed in 
40 Table 3 below. 

Example IB 

This example illustrates examination of the OCEF gene for polymorphic sites in the following 
target regions: 2000 base pairs upstream of the ATG start codon; each of the exons, including 
45 approximately 100 base pairs on either side of the exon; and 500 to 1000 base pairs downstream of the 
termination codon. 



Amplification of Target Regions 

PCR primer pairs, which were designed based on the nearly complete OCIF genomic sequence 

37 
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reported in the Genbank database (Accession Nos. AB008821 and AB008822), are set forth below: 
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Accession No. AB00882 1 
Promoter fragment 1 
Forward primer: 

69-94 5'- CTGTAAACAATTTCAGTGGCAACCCG -3' (SEQ ID NO:227) 

Reverse primer 

427-402 5'- CCGTGCTATTCTGCATTCACTCCTTG -3* (SEQ ID NO:228) 

PCR product 359 nt product 



Promoter fragment 2 
Forward primer: 

263-288 5'- CGTAGGAAGCTCCGATACCAATAGCC -3' (SEQ ED NO:229) 

Reverse primer: 

1 5 868-846 5'- GGAGAGC AGGGGAA AAAAAAGCC -3' (SEQ ED NO:230) 

PCR product 606 nt 

Promoter fragment 3 
Forward primer: 

20 558-583 5'- TTGAGGTTTCAGAACCCGAAGTGAAG -3' (SEQ ID NO:231) 

Reverse primer 

1069-1045 5'- CAACAGGAAGTATCGCCTGCCTTTG -3'(SEQ ED NO:232) 
PCR product 512 nt 

25 Exon 1 

Forward primer: 

945-970 5'- GTTAATCCTGGAGCTTTCTGCACACC -3' (SEQ ID NO:233) 

Reverse primer: 

1445-1420 5'- TCGCTGTCTGTCTCTCTCTTGCTGTC -3' (SEQ ED NO:234) 
30 PCR product 501 nt 

Accession No. AB008822 
Exon 2 

Forward primer: 

35 91-114 5'- TCATGCTAAGATGATGCCACTGTG -3' (SEQ ED NO:235) 
Reverse primer: 

686-665 5 - GCAAAGTATTCCTCTGAGCAATGG -3' (SEQ ED NO:236) 

PCR product: 598 nt 

40 Exon 3 

Forward primer: 

4435-4457 5*- CTGTGTTAAGAGGGCATCTGCTG -3' (SEQ ID NO:237) 
Reverse primer: 

4791-4770 5'- TTGACCAAGAATGTGGCTGGAG -3' (SEQ ED NO:238) 

45 PCR product: 357 nt 

Exon 4 

Forward primer: 

6549-6573 5'- GCCACTAAGACCAGCCAACAGAAGC -3' (SEQ ID NO:239) 
50 Reverse primer: 

7 1 66-7 1 20 5'- GAGCAGGGGTGAGAACAAAACCTTG -3' (SEQ ID NO:240) 
PCR product: 596 nt 

Exon 5 
55 Forward primer: 

38 
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8150-8175 5'- GGCTGTGTGTCTCCTTTAGTTCCTCG -3' (SEQ ED NO:241) 

Reverse primer: 

9263-9238 5'- CTGACAGTTGGATTAACCATTTGGGG -3' (SEQ ID NO:242) 
PCR product: 11 14 nt 

5 

These primer pairs were used in PCR reactions containing genomic DNA isolated from 
immortalized cell lines for a reference population of 70 human individuals. The PCR reactions were 
carried out under the following conditions: 



Reaction volume =50 )il 

10 10 x Advantage 2 Polymerase reaction buffer (Clontech) = 5 \i\ 

100 ng of human genomic DNA 1 = 5 p.1 

10 mM dNTP = 1 |d 

Advantage 2 Polymerase enzyme mix (Clontech) = 0.5 jxl 

Forward Primer (10 |iM) = 1 \i\ 

1 5 Reverse Primer (1 0 ^M) = 1 |il 

Water =36.5 ^1 

Amplification profile: 

94°C - 2 min. 1 cycle 



20 94°C - 30 sec. 

70°C-45sec. 10 cycles 

72°C - 1 min. 

94°C - 30 sec. 

25 64°C-45sec. 35 cycles 

72°C - 1 min. 

Sequencing of PCR Products 

The PCR products were purified by Solid Phase Reversible Immobilization using the protocol 

developed by the Whitehead Genome Center. A detailed protocol can be found at 

30 http://www.genome.wi.mit.edu/sequencing /protocols/pure/SPRJ pcr.html. 

Briefly, carboxyl coated magnetic beads (10 mg/ml) were washed three times with wash buffer 

(0.5 M EDTA, pH 8.0). Ten jil of washed beads and 50 pi of HYB BUFFER (2.5M NaCl/20% PEG 

8000) were added to each PCR reaction mixture (50 jil). The reaction mixture was mixed well and 

incubated at RT for 10 min. The microtitre plate was placed on a magnet for 2 min and the beads washed 

35 twice with 150 |il of 70% EtOH. The beads were air dried for 2 min and resuspend in 20 pi of elution 

buffer (10 mM trisacetate, pH 7.8) and incubated at RT for 5 min. The beads were magnetically 

separated and the supernatant removed for testing and sequencing. 

The purified PCR products were sequenced in both directions using sequencing primers that were 

identical to the PCR primers except for the exon 5 amplification product which, due to its size, was 

40 sequenced using two sets of primers. The Exon 5 sequencing primer sets are set forth below: 

Accession No. AB008822 
Forward primer 8 1 50-8 1 75 

5'- GGCTGTGTGTCTCCTTTAGTTCCTCG -3' (SEQ ID NO:241) 

Reverse primer 8928-8903 

39 



WO 01/04137 



PCT/US00/18803 



5'- TGGGAAAGTGGTACGTCTTTGAGTGC -3' (SEQ ID NO:243) 
Forward primer 8674-8697 

5'- TTGACCTCTGTGAAAACAGCGTGC -3* (SEQ ID NO:244) 
5 Reverse primer 9145-9120 

5 - CTGAAAGCCTCAAGTGCCTGAGAAAC -3 ! (SEQ ID NO;245) 

Sequencing reactions were performed using the Big-Dye terminator kit from PE Biosystems (Foster City, 
CA) according to the manufacturer's instructions. The sequencing products were analyzed on an ABI 477 
10 automated sequencer (PE Biosystems, Foster City, CA). 



Analysis of Sequences for Polymorphic Sites 

Sequences were analyzed for the presence of polymorphisms using the Polyphred program 
(Nickerson et al., 14 Nucleic Acids Res. 2745-275 1, 1997). The presence of a polymorphism was 
1 5 confirmed on both the strands. The polymorphisms and their locations in the OCIF gene are listed in 
Table 3 below. 



Table 3. Polymorphic sites Identified in the TNFRSF11B Gene 


Polymorphic Site 
Number 


Nucleotide Position 


Reference 
Allele 


Variant 
Allele 


Example 


PS1 


491 (Acc.#AB0O8821) 


G 


A 


IB 


PS2 


676 (Acc.#AB008821) 


G 


T 


1A 


PS3 


889(Acc.#AB008821) 


C 


T 


1A 


PS4 


916(Acc.#AB008821) 


G 


T 


1A 


PS5 


950 (Acc.#AB008821) 


T 


C 


1A 


PS6 R 


1181 (Acc.#AB008821) 


G 


C 


1A, IB 


PS7 


1217(Acc.#AB008821) 


C 


T 


1A, IB 


PS8 


1294 (Acc. #AB008821) 


G 


A 


1A,1B 


PS9 


1390(Acc.#AB008821) 


C 


A 


1A 


PS10 


505 (Acc. # AB008822) 


C 


T 


1A 


PS11 


668 (Acc. # AB008822) 


T 


C 


1A 


PS12 


4397 (Acc.#AB008822) 


T 


C 


1A 


PS13 


4501 (Acc.#AB008822) 


C 


T 


1A 


PS14 


6601 (Acc.#AB008822) 


G 


A 


1A 


PS15 


6824 (Acc.#AB008822) 


C 


T 


1A 


PS16 


6839 (Acc.#AB008822) 


G 


A 


1A, IB 


PS17 


6845 (Acc.#AB008822) 


A 


G 


1A 


PS18 


6893 (Acc.#AB008822) 


A 


G 


1A, IB 


PS19 


6950 (Acc.#AB008822) 


A 


C 


IB 


PS20 


8258 (Acc. #AB008822) 


G 


A 


IB 


PS21 


8391 (Acc. # AB008822) 


T 


C 


IB 


PS22 


8622 (Acc.#AB008822) 


G 


T 


IB I 


PS23 


8694 (Acc. #AB008822) 


G 


A 


1A 


PS24 


8955 (Acc. #AB008822) 


A 


T 


IB 


PS25 


9003 (Acc.#AB008822) 


T 


C 


1A, IB 



Previously reported in the literature. 

20 Example 2 

This example illustrates analysis of the TNFRSF1 IB polymorphisms identified in the Index 

40 
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Repositories for human genotypes and haplotypes for all polymorphic sites except PS1 and PS 1 6. 

The different genotypes containing these polymorphisms that were observed in these reference 
populations are shown in Table 4 below, with the haplotype pair indicating the combination of haplotypes 
determined for the individual using the haplotype derivation protocol described below. In Table 4, 
homozygous positions are indicated by one nucleotide and heterozygous positions are indicated by two 
nucleotides. 
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The haplotype pairs shown in Table 4 were estimated from the unphased genotypes using an 
extension of Clark's algorithm (Clark, A.G. (1990) Mol Bio Evol 7, 1 1 1-122), as described in U.S. 
Provisional Patent Application filed April 19, 2000 and entitled "A Method and System for Determining 
5 Haplotypes from a Collection of Polymorphisms". In this method, haplotypes are assigned directly from 
individuals who are homozygous at all sites or heterozygous at no more than one of the variable sites. 
This list of haplotypes is then used to deconvolute the unphased genotypes in the remaining (multiply 
heterozygous) individuals. 

By following this protocol, it was determined that the Index Repositories examined herein and, 
10 by extension, the general population contains the 27 human TNFRSF1 IB haplotypes shown in Table 5 
below. 
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In view of the above, it will be seen that the several advantages of the invention are achieved and 
other advantageous results attained. 

As various changes could be made in the above methods and compositions without departing from 
the scope of the invention, it is intended that all matter contained in the above description and shown in 
5 the accompanying drawings shall be interpreted as illustrative and not in a limiting sense. 

All references cited in this specification, including patents and patent applications, are hereby 
incorporated in their entirety by reference. The discussion of references herein is intended merely to 
summarize the assertions made by their authors and no admission is made that any reference constitutes 
prior art. Applicants reserve the right to challenge the accuracy and pertinency of the cited references. 
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What is Claimed is: 

1. An isolated polynucleotide comprising a nucleotide sequence selected from the group consisting of: 

(a) a first nucleotide sequence which is a polymorphic variant of a reference sequence for 
Osteoclastogenesis Inhibitory Factor (TNFRSF1 IB) gene or a fragment thereof, wherein the 
reference sequence comprises SEQ ED NOS:l- 2, and the polymorphic variant comprises at least 
one polymorphism selected from the group consisting of adenine at PS1, thymine at PS2, thymine 
at PS3, thymine at PS4, cytosine at PS5, thymine at PS7, adenine at PS8, adenine at PS9, thymine 
at PS10, cytosine at PS1 1, cytosine at PS12, thymine at PS13, adenine at PS14, thymine at PS15, 
adenine at PS 1 6, guanine at PS 1 7, guanine at PS 1 8, cytosine at PS i 9, adenine at PS20, cytosine at 
PS21, thymine at PS22, adenine at PS23, thymine at PS24 and cytosine at PS25; and 

(b) a second nucleotide sequence which is complementary to the first nucleotide sequence. 

2. The isolated polynucleotide of claim 1 which comprises a TNFRSF1 IB isogene. 

3. The isolated polynucleotide of claim 1 which is a DNA molecule and comprises both the first and 
second nucleotide sequences and further comprises expression regulatory elements operably linked 
to the first nucleotide sequence. 

4. A recombinant organism transformed or transfected with the isolated polynucleotide of claim 1, 

wherein the organism expresses a TNFRSF1 IB protein encoded by the first nucleotide sequence. 

5. The recombinant organism of claim 4 which is a nonhuman transgenic animal. 

6. The isolated polynucleotide of claim 1, wherein the first nucleotide sequence is a polymorphic variant 

of a fragment of the TNFRSF1 IB gene, the fragment comprising one or more polymorphisms 
selected from the group consisting of adenine at PS1, thymine at PS2, thymine at PS3, thymine at 
PS4, cytosine at PS5, thymine at PS7, adenine at PS8, adenine at PS9, thymine at PS10, cytosine at 
PS1 1, cytosine at PS12, thymine at PS13, adenine at PS14, thymine at PS15, adenine at PS16, 
guanine at PS 17, guanine at PS 18, cytosine at PS 19, adenine at PS20, cytosine at PS21, thymine at 
PS22, adenine at PS23, thymine at PS24 and cytosine at PS25. 

7. An isolated polynucleotide comprising a nucleotide sequence which is a polymorphic variant of a 
reference sequence for the TNFRSF1 IB cDNA or a fragment thereof, wherein the reference 
sequence comprises SEQ ID NO:3 and the polymorphic variant comprises at least one polymorphism 
selected from the group consisting of thymine at a position corresponding to nucleotide 699, adenine 
at a position corresponding to nucleotide 714, guanine at a position corresponding to nucleotide 720, 
guanine at a position corresponding to nucleotide 768, adenine at a position corresponding to 
nucleotide 841, thymine at a position corresponding to nucleotide 1 102 and cytosine at a position 
corresponding to nucleotide 1150. 

8. A recombinant organism transformed or transfected with the isolated polynucleotide of claim 7, 
wherein the organism expresses a Osteoclastogenesis Inhibitory Factor (TNFRSF1 IB) protein 
encoded by the polymorphic variant sequence. 

9. The recombinant organism of claim 8 which is a nonhuman transgenic animal. 
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10. An isolated polypeptide comprising an amino acid sequence which is a polymorphic variant of a 
reference sequence for the TNFRSF1 IB protein or a fragment thereof, wherein the reference 
sequence comprises SEQ ID NO: 4 and the polymorphic variant comprises one or more variant 
amino acids selected from the group consisting of methionine at a position corresponding to amino 
acid position 240, methionine at a position corresponding to amino acid position 281, and serine at a 
position corresponding to amino acid 368. 

11. An isolated antibody specific for and immunoreactive with the isolated polypeptide of claim 1 0. 

12. A method for screening for drugs targeting the isolated polypeptide of claim 10 which comprises 
contacting the TNFRSF1 IB polymorphic variant with a candidate agent and assaying for binding 
activity. 

13. A composition comprising at least one genotyping oligonucleotide for detecting a polymorphism in 
the Osteoclastogenesis Inhibitory Factor (TNFRSF1 IB) gene at a polymorphic site selected from 
PS1, PS2, PS3, PS4, PS5, PS7, PS8, PS9, PS10, PS1 1, PS12, PS13, PS14, PS15, PS16, PS17, PS18, 
PS19, PS20, PS21 , PS22, PS23, PS24, and PS 25. 

14. The composition of claim 13, wherein the genotyping oligonucleotide is an allele-specific 
oligonucleotide that specifically hybridizes to an allele of the TNFRSF1 IB gene at a region 
containing the polymorphic site. 

15. The composition of claim 14, wherein the allele-specific oligonucleotide comprises a nucleotide 
sequence selected from the group consisting of of SEQ ID NOS:5-54, the complements of SEQ ID 
NOS: 5-54, and SEQ ID NOS:55 -150. 

16. The composition of claim 13, wherein the genotyping oligonucleotide is a primer-extension 
oligonucleotide. 

17. A method for genotyping the Osteoclastogenesis Inhibitory Factor (TNFRSF1 IB) gene of an 
individual, comprising determining for the two copies of the TNFRSF1 IB gene present in the 
individual the identity of the nucleotide pair at one or more polymorphic sites (PS) selected from 
PS1, PS2, PS3, PS4, PS5, PS7, PS8, PS9, PS10, PS11, PS12, PS13, PS14, PS15, PS16, PS17, PS18, 
PS19, PS20, PS21, PS22, PS23, PS24, and PS 25. 

18. The method of claim 17, wherein the determining step comprises: 

(a) isolating from the individual a nucleic acid mixture comprising both copies of the TNFRSF1 IB 
gene, or a fragment thereof, that are present in the individual; 

(b) amplifying from the nucleic acid mixture a target region containing at least one of the 
polymorphic sites; 

(c) hybridizing a primer extension oligonucleotide to one allele of the amplified target region; 

(d) performing a nucleic acid template-dependent, primer extension reaction on the hybridized 
genotyping oligonucleotide in the presence of at least two different terminators of the reaction, 
wherein said terminators are complementary to the alternative nucleotides present at the 
polymorphic site; and 
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(e) detecting the presence and identity of the terminator in the extended genotyping 
oligonucleotide. 

19. A method for haplotyping the Osteoclastogenesis Inhibitory Factor (TNFRSF1 IB) gene of an 
individual which comprises determining, for one copy of the TNFRSF1 IB gene present in the 
individual, the identity of the nucleotide at one or more polymorphic sites (PS) selected from PS1, 
PS2, PS3, PS4, PS5, PS7, PS8, PS9, PS10, PS1 1, PS12, PS13, PS14, PS15, PS16, PS17, PS18, 
PS19, PS20, PS21, PS22, PS23, PS24, and PS 25. 

20. The method of claim 19, wherein the determining step comprises 

(a) isolating from the individual a nucleic acid molecule containing only one of the two copies of 
the TNFRSF1 IB gene, or a fragment thereof, that is present in the individual; 

(b) amplifying from the nucleic acid molecule a target region containing at least one of the 
polymorphic sites; 

(c) hybridizing a primer extension oligonucleotide to one allele of the amplified target region; 

(d) performing a nucleic acid template-dependent, primer extension reaction on the hybridized 
genotyping oligonucleotide in the presence of at least two different terminators of the reaction, 
wherein said terminators are complementary to the alternative nucleotides present at the 
polymorphic site; and 

(e) detecting the presence and identity of the terminator in the extended genotyping 
oligonucleotide. 

21. A method for predicting a haplotype pair for the Osteoclastogenesis Inhibitory Factor (TNFRSF 1 1 B) 
gene of an individual comprising: 

(a) identifying an TNFRSF 1 IB genotype for the individual at two or more of polymorphic sites 
selected fromPSl, PS2, PS3, PS4, PS5, PS7, PS8, PS9, PS10, PS11, PS12, PS13, PS14, PS15, 
PS16, PS17, PS18, PS19, PS20, PS21, PS22, PS23, PS24, and PS 25; 

(b) enumerating all possible haplotype pairs which are consistent with the genotype; 

(c) accessing data containing the TNFRSF 1 IB haplotype pairs determined in a reference 
population; and 

(d) assigning a haplotype pair to the individual that is consistent with the data. 

22. A method for identifying an association between a trait and at least one genotype or haplotype of the 

Osteoclastogenesis Inhibitory Factor gene which comprises comparing the frequency of the genotype 

or haplotype in a population exhibiting the trait with the frequency of the genotype or haplotype in a 

reference population, wherein the genotype or haplotype comprises a nucleotide pair or nucleotide 

located at one or more polymorphic sites selected from PS1, PS2, PS3, PS4, PS5, PS7, PS8, PS9, 

PS10, PS1 1, PS12, PS13, PS14, PS15, PS16, PS17, PS18, PS19, PS20, PS21, PS22, PS23, PS24, 

and PS 25, wherein a higher frequency of the genotype or haplotype in the trait population than in 

the reference population indicates the trait is associated with the genotype or haplotype. 

23 The method of claim 22, wherein the haplotype is selected from haplotype numbers 1-27 shown in 
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Table 5. 

24. The method of claim 23, wherein the trait is a clinical response to a drug targeting TNFRSF1 IB . 

25. A computer system for storing and analyzing polymorphism data for the Osteoclastogenesis 
Inhibitory Factor gene, comprising: 

(a) a central processing unit (CPU); 

(b) a communication interface; 

(c) a display device; 

(d) an input device; and 

(e) a database containing the polymorphism data; 

wherein the polymorphism data comprises the genotypes and haplotype pairs shown in Table 4 and 
the haplotypes shown in Table 5. 

26. A genome anthology for the Osteoclastogenesis Inhibitory Factor (TNFRSF1 IB) gene which 
comprises TNFRSF1 IB isogenes defined by haplotypes 1-27 shown in Table 5. 
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POLYMORPHISMS IN THE TNFRSF11B GENE (Accession No. AB008821) 



CTGGAGACAT 
AGGGACTTTT 
GTAATCCATG 
GACCAGGGAA 
CTTCTGTCGA 
AGAAGTTCAG 
TGGGGTTGGT 
AGTCCAATTT 
ACAAGGAGTG 
GTTAGAAAAA 

TGCGAACTCC 
CTGTAATTTG 
ACGGCGGAAA 
ACACTCCAAC 

GAGACACAAG 
GGTCCCGGCT 
TCAGAGCCCC 
TTTTTTCCCC 

GCCCCACCTC 

CCTGGAGCTT 
AAAGAAAGGT 
CAGGCGATAC 
CGGGCTGCGG 
CCTGAGGTTT 

TGGTAAGTCC 

TGCCACCTGG 

CTCGCTCCCT 
TTTCAAGTTA 

TCCTGCTGAC 
AGGGAGAGAG 
TTCTGTCTTC 
TTGCTTAATG 
TAGATTCAGA 
AATTAGAATA 



ATAACTTGAA 
TCAGCCATCT 
AATGGGACCA 
TTAATGGGGG 
TAGCTTGAGG 
CGCGTAGGAA 
GAAGGGAACA 
TCACTCTGCA 
AATGCAGAAT 
TTCCTACTAC 

CCGAAAAGGG 
AGGTTTCAGA 
CTCACAGCTT 
TGCGTCCGGA 

CACAGCAGCT 
GCCAGGAGGC 
GCGGAGACAG 
TGCTCTCCCA 

CCTGGGGGAT 
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TCTGCACACC 
GCAAAGTTTG 
TTCCTGTTGC 
AGACGCACCG 
CCGGGGACCA 



CACTTGGCCC 
GTAAACAATT 
CACTTTACAA 
AGACAGCGAA 
CTAGTGGAAA 
GCTCCGATAC 
GTGCTCCGCA 
GATTCTCTCT 
AGCACGGGCT 
ATGGTTTATG 

CTCAGACAAT 
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TCGCCCAGCG 
TCTTGGCTGG 
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GCCCAGCGTG 
TGGCCGCTGG 
CAGCCGCCTT 
GGGGACAGAC 

CCTTTCCGCC 
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GTCCAGGATA 
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GAGCGCTCGC 
CAATGAACAA 
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AAACTTGTTT 
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GATTTTTCTA 



TGATGGGGAA 
TCAGTGGCAA 
GTCATCAAGT 
CCCTAGAGCA 
GACCTCGAGG 
CAATAGCCCT 
AGGTTATCCC 
GGCTCTAACT 
TTAGGGCCAA 
TAAACTTGAA 

GCCATGCATA 
AAGGGGTCAG 
AGAGGACAAA 
ATCGGACTCT 

TGCCCAGCCC 
CGGGAAGGGG 
GTTCCTCAGC 
ACCACCGCCC 
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CCAGCCCTGA 

TCCCGCCCAA 
GAAAAATGAC 
TATATAACGT 
CCAGCCGCCG 
GTTGCTGTGC 
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CCGGCGCCTG 

CCGGCGGGGA 

AGGCTGGAGC 
GTCCTTTTAC 
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AGAGACAGAC 
GAAAGTTTTA 
TCATGTAGTC 
GTGGCAAAGA 
AGTCTTCTTA 
A 



GCAGCTCTGC 
CCCGCGAACT 
CTAACTTCTA 
AAGTGCCAAA 
AGGCTACTCC 
TTGATGATGG 
TGCCCCAGGC 
ACCCCAGATA 
TCAGACATTA 
GATGAATGAT 
A 

AAGAGGGGCC 
GCAGCCGGGT 
GGTCTGGGAC 
CAGGGTGGAG 

TCCCACCGCT 
CCGGGAAACC 
CCGGTGGCTT 
CACCCCTCAC 

AAGCGTTAAT 
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GCTTCCTAAA 
TGATCAAAGG 
GATGAGCGCA 
CCTCCAAGCC 
TGCGCQCTCG 

GGGAGGCTGC 

GAAGGCTCCA 
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AGGAAACCGC 
GCTGCAAAGT 

AGCGAGAGAG 
GTCATTAACC 
CATACTATCT 
AGTGGTCCCT 
AAACCATTAT 
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AAGCTTACTT 
GAGTCAAATG 
ATGATGCCAC 
AG TGGACCAC 
GAAACCTCTC 
AAj^CAACAC 
^CqVCTACTA = 
AGCCCCGTGT 
CCACAACCGC 
TCTGCTTCAA 
I^TACGTCTC 
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AGTTGTGACA 
GCAAATTGCA 
GTGCTTCCAA 

ATTTAATGAC 
GATGGTTTTT 
ATAGTTGATC 
TGCAGCACTT 
AATTATGCTC 
AATTAATTAT 
TAAAAACAAG 
GGTAAGGACT 
CAAAACAGAT 
TTTCTCTTTT 
TCTTTTAAAC 
CAGTGCAGCC 
TACTTAGATG 
GCAGTGTTTC 
TCCTAGGCTG 
TTTGGCAGCA 
CCCCATGTGA 
TGGCAAAATC 
TAACTATTTT 
ACGACCAATA 
AAAATCTATT 
AGATGCAAAT 
TGAGGGAGGA 
GACTCAAAAG 
ATCCCATCAG 
TGTAGAAAAA 
AAAAATAATG 
TTTGTAGAAT 
CATTTTACTC 
CTCCCTAAAA 
GCCCAAAGAG 
AACCGTTTTG 
CATATTTTAT 
AGCATGGTGA 
TCACATTTTG 
AAGGTAAAAT 
TCAGTTATAG 
GCTTCTGAAT 
TGTTAAATAA 



TGTGCCAAAT 
ATACTGTTGC 
TGTGTTCCTT 

ccaggaaacg 
atcagctgtt' 
tgtacagcaa 



CTCATTAGGC 
ACATAAGAAC 
TCTCCTTCTA 
TTTCCTCCAA 
GTGTGACAAA 
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TTAAGGTAAT 
AAACCTATTT 
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AGTACCTTCA 
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ACAGGACTTT 
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ATCTCCATTA 
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AAATCTCAAA 
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TATACCTCTA 
TTTGACAAAC 
ATATTAATGA 
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CAGTAACCCA 
TATAACTTTT 
GAAGAAGTAA 
TCAACTGAAG 
GACTCCTTTT 
CCCCTGGACT 
CAGCCAAAAA 
ACTCCTGGTT 
AACTCTCAAA 
CTTTAGGTTT 
CTGTTTCTCA 
TGCATCAGAG 
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ATAAAAGCAA 
TGAAAAGTGG 
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CCTGGCAGGT 
CATTACTTAG 
AGGGAAGTCA 
ATATTATGAA 
CTTTAAAAGC 



TGGCACACCA_ 

^AGTACGTC 

GCAAGGAAGG 

TGCCCTCCTG 

AAAATTAATT 



gtgacgagtg 
^gcaggagt" 
gcgctacctt 
gatttggagt 



TCTAT ACTGC 
GC/^CGCAC 
GAGATAGAGT 
GGTGCAAGCT 



AGGATCATGC AAAGTCAGAT 



ACACTTTTGT TCTGATGACA TTATAGGATA 
AACCTGCCAG GTAGGTACTA TGTGTCTGGA 
CTCAGAGGAA TACTTTGCCA CTACAGGGCA 



TGCAGCAAAT TATTCTCTCA 
TTAAAGAAAC AAACTCAAGT 
TATTTCACTT CAGCATGGAC 
ATCAGAAATG TTAATTTATA 
GACTCTGGAG TGCTAACAAT 
AGAATGGTGA GGGGAATTGC 
CCTTTAGCAT GGGAGCTGAG 
CTCTTCAATG AGCTTATTCT 
AGAGCAAGCA CTTGCCTATA 
CAGCATTGGT CAGGGCTCAT 
CGTTTTTTTT CTGCCACATT 
CATAGCTTGA GAAAATTAAG 
TCAGTATAGA TTCTGATGAC 
CCCTGCTGAT ATTTTAAGAA 
TGTGGGCAGC TGTCCTGCGC 
CTAGCCACTA GATACCAATA 
TGTCTTCAGA CACTGTCAAA 
GAGAACAGGG TCATCAATGC 
ACTTGTGATA T AC AAAGT CT 
AAAGGCATAC AAATGAAACA 
AATAGTGAAT CTTATAAAAT 
TCCCTTAAAA TTCCTCTTCG 
GTTCCTACTT TCTATTGGAT 
TTGTGTGTGT GTCAGGGTGC 
AT CCATGTAA TTCATTCAGT 
GCTATGCAGC TTGGAAACTA 
GGATCTTTCT TAAATAAGTA 
GGCAGCCAGA AGACTCAGAA 
GGCAGCT^CAG TGGGATTTAT 
CGGTTCCTCT TGGGAAATAA 
TATGTGGTGT TACTCTAAAA 
TGCTGTTTTG AAAT CAGATT 
CTGTTAATTC CTGTGGAATT 
GTAAAGCCAA ATTTCTCCAT 
TATAATTTTT ATATTTCCAC 
ATGGATAGAT CTTTTTCATA 
TGTTCATGTT CAGGAAGGTC 
ACATTAGTTC TGTCATTCTT 
T AACTT AC CT AAAAGAAATA 
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TGAGATGCAT 
TGCACTATTG 
ACCTTCAAAC 
CCAAGAGAGT 
AAGCAGTTAT 
ATTTCATTAT 
TGTTTGGGAG 
TTATCTTAGA 
AACCAAGTGC 
GTGTATTGAA 
TGCGAAGCTT 
AGTATCCACT 
TCAGTTTGAA 
ATATCTGGAT 
ATTGTAGAAT 
GCAGTCCTTC 
TGTCGCCAGG 
TAAGTATCTG 
AAATTATTAG 
TTCAAAAATC 
TAATCACAGA 
TATGAGTATT 
GGTACTTTGA 
GGGGTGTGGA 
AAGTTGTATA 
GAGAATTTTG 
AGAAAATCTG 
CAAAAGTACA 
TTACCTCTCC 
GAGGTTTCCA 
AGTATTTAAT 
GTCTCCTCTC 
ACTTAGAGCA 
CATTATAATT 
TGATAGTAAT 
AAAAGTACCA 
ATTAGATAAA 
AGATTCTTTT 
TCTGACACAT 
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ATGAACTTCT CATTAGGATG CAGGAGAAGA CCCAAGCCAC AGATATGTAT 
CTGAAGAATG AACAAGATTC TTAGGCCCGG CACGGTGGCT CACATCTGTA 
ATCTCAAGAG TTTGAGAGGT CAAGGCGGGC AGATCACCTG AGGTCAGGAG 
TTCAAGACCA GCCTGGCCAA CATGATGAAA CCCTGCCTCT ACTAAAAATA 
CAAAAATTAG CAGGGCATGG TGGTGCATGC CTGCAACCCT AGCTACTCAG 
GAGGCTGAGA CAGGAGAATC TCTTGAACCC TCGAGGCGGA GGTTGTGGTG 
AGCTGAGATC CCTCTACTGC ACTCCAGCCT GGGTGACAGA GATGAGACTC 
CGTCCCTGCC GCCGCCCCCG CCTTCCCCCC CAAAAAGATT CTTCTTCATG 
CAGAACATAC GGCAGTCAAC AAAGGGAGAC CTGGGTCCAG GTGTCCAAGT 
CACTTATTTC GAGTAAATTA GCAATGAAAG AATGCCATGG AATCCCTGCC 
CAAATACCTC TGCTTATGAT ATTGTAGAAT TTGATATAGA GTTGTATCCC 
ATTTAAGGAG TAGGATGTAG TAGGAAAGTA CTAAAAACAA ACACACAAAC 
AGAAAACCCT CTTTGCTTTG TAAGGTGGTT CCTAAGATAA TGTCAGTGCA 
ATGCTGGAAA TAATATTTAA TATGTGAAGG TTTTAGGCTG TGTTTTCCCC 
TCCTGTTCTT TTTTTCTGCC AGCCCTTTGT CATTTTTGCA GGTCAATGAA 
TCATGTAGAA AGAGACAGGA GATGAAACTA GAACCAGTCC ATTTTGCCCC 
TTTTTTTATT TTCTGGTTTT GGTAAAAGAT ACAATGAGGT AGGAGGTTGA 
GATTTATAAA TGAAGTTTAA TAAGTT TCTG TAGCTTTGAT TTTTCTCTTT 
CATATTTGTT ATCTTGCATA AGCCAGAATT GGCCTGTAAA ATCTACATAT 
GGATATTGAA GTCTAAATCT GTTCAACTAG CTTACACTAG ATGGAGATAT 
TTTCATATTC AGATACACTG GAATGTATGA TCTAGCCATG CGTAATATAG 
TCAAGTGTTT GAAGGTATTT ATTTTTAATA GCGTCTTTAG TTGTGGACTG 
GTTCAAGTTT TTCTGCCAAT GATTTCTTCA AATTTATCAA ATATTTTTCC 
ATCATGAAGT AAAATGCCCT TGCAGTCACC CTTCCTGAAG TTTGAACGAC 
TCTGCTGTTT TAAACAGTTT AAGCAAATGG TATATCATCT TCCGTTTACT 
ATGTAGCTTA ACTGCAGGCT TACGCTTTTG AGTCAGCGGC CAACTTTATT 
GCCACCTTCA AAAGTTTATT ATAATGTTGT AAATTTTTAC TTCTCAAGGT 
TAGCATACTT AGGAGTTGCT TCACAATTAG GATTCAGGAA AGAAAGAACT 
TCAGTAGGAA CTGATTGGAA TTTAATGATG CAGCATTCAA TGGGTACTAA 
TTTCAAAGAA TGATATTACA GCAGACACAC AGCAGTTATC TTGATTTTCT 
AGGAATAATT GTATGAAGAA TATGGCTGAC AACACGGCCT TACTGCCACT 
CAGCGGAGGC TGGACTAATG AACACCCTAC CCTTCTTTCC TTTCCTCTCA 
CATTTCATGA GCGTTTTGTA GGTAACGAGA AAATTGACTT GCATTTGCAT 
TACAAGGAGG AGAAACTGGC AAAGGGGATG ATGGTGGAAG TTTTGTTCTG 
TCTAATGAAG TGAAAAATGA AAATGCTAGA GTTTTGTGCA ACATAATAGT 

C 

AGCAGTAAAA ACCAAGTGAA AAGTCTTTCC AAAACTGTGT TAAGAGGGCA 
TCTGCTGGGA AACGATTTGA GGAGAAGGTA CTAAATTG CT TG GTATTTTC 
CGTAGGJ&CC 'ICAGAGCGAA " ATACAGTTTG CAAAAGATGT yC'CS§A TGGGij 

jTCTTCTCAAA ; TCAGACGTCA TCTAAAGCAC CGTGTAGAAA AC$GACAAAT 
[TGCAGTGTCTlCTGGTCTCCT GCTAACTCAG AAAGGAAATG- CAlSf^ACACGiT 
'CAACATATGT ^CCGGAAACA GTGAA TCAAC fCMAAATOT GG AAT AGGTA 
ATTACATTCC AAAATACGT C ^ TTTGTACGAT TTTGT AGTAT CATCTCT CTC 
TCTGAGTTGA ACACAAGGCC TCCAGCCACA TTCTTGGTCA AACTTACATT 
TTCCCTTTCT TGAATCTTAA CCAGCTAAGG CTACTCTCGA TGCATTACTG 
CTAAAGCTAC CACTCAGAAT CTCTCAAAAA CTCATCTTCT CACAGATAAC 
ACCTCAAAGC TTGATTTTCT CTCCTTTCAC ACTGAAATCA AATCTTGCCC 
ATAGGCAAAG GGCAGTGTCA AGTTTGCCAC TGAGATGAAA TTAGGAGAGT 
CCAAACTGTA GAATTCACGT TGTGTGTTAT TACTTTCACG AATGTCTGTA 
TTATTAACTA AAGTATATAT TGGCAACTAA GAAGCAAAGT GATATAAACA 
TGATGACAAA TTAGGCCAGG CATGGTGGCT TACTCCTATA ATCCCAACAT 
TTTGGGGGGC CAAGGTAGGC AGATCACTTG AGGTCAGGAT TTCAAGACCA 
GCCTGACCAA CATGGTGAAA CCTTGTCTCT ACTAAAAATA CAAAAATTAG 
CTGGGCATGG TAGCAGGCAC TTCTAGTACC AGCTACTCAG GGCTGAGGCA 
GGAGAATCGC TTGAACCCAG GAGATGGAGG TTGCAGTGAG CTGAGATTGT 
ACCACTGCAC TCCAGTCTGG GCAACAGAGC AAGATTTCAT CACACACACA 
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CACACACACA CACACACACA CATTAGAAAT GTGTACTTGG CTTTGTTACC 
TATGGTATTA GTGCATCTAT TGCATGGAAC TTCCAAGCTA CTCTGGTTGT 5500 
GTTAAGCTCT TCATTGGGTA CAGGTCACTA GTATTAAGTT CAGGTTATTC 
GGATGCATTC CACGGTAGTG ATGACAATTC ATCAGGCTAG TGTGTGTGTT 5600 
CACCTTGTCA CTCCCACCAC TAGACTAATC TCAGACCTTC ACTCAAAGAC 
ACATTACACT AAAGATGATT TGCTTTTTTG TGTTTAATCA AGCAATGGTA 5700 
TAAACCAGCT TGACTCTCCC CAAACAGTTT TTCGTACTAC AAAGAAGTTT 
ATGAAGCAGA GAAATGTGAA TTGATATATA TATGAGATTC TAACCCAGTT 5800 
CCAGCATTGT TTCATTGTGT AATTGAAATC ATAGACAAGC CATTTTAGCC 
TTTGCTTTCT TATCTAAAAA AAAAAAAAAA AAAATGAAGG AAGGGGTATT 5900 
AAAAGGAGTG ATCAAATTTT AACATTCTCT TTAATTAATT CATTTTTAAT 
TTTACTTTTT. TTCATTTATT GTGCACTTAC TATGTGGTAC TGTGCTATAG 600 0 

AGGCTTTAAC ATTTATAAAA ACACTGTGAA AGTTGCTTCA GATGAATATA 
GGTAGTAGAA CGGCAGAACT AGTATTCAAA GCCAGGTCTG ATGAATCCAA 6100 
AAACAAACAC CCATTACTCC CATTTTCTGG GACATACTTA CTCTACCCAG 
ATGCTCTGGG CTTTGTAATG CCTATGTAAA TAACATAGTT TTATGTTTGG 6200 
TTATTTTCCT ATGTAATGTC TACTTATATA TCTGTATCTA TCTCTTGCTT 
TGTTTCCAAA GGTAAACTAT GTGTCTAAAT GTGGGCAAAA AATAACACAC 6300 
TATTCCAAAT TACTGTTCAA ATTCCTTTAA GTCAGTGATA ATTATTTGTT 
TTGACATTAA TCATGAAGTT CCCTGTGGGT ACTAGGTAAA CCTTTAATAG 6400 
AATGTTAATG TTTGTATTCA TTATAAGAAT TTTTGGCTGT TACTTATTTA 
CAACAATATT TCACTCTAAT TAGACATTTA CTAAACTTTC TCTTGAAAAC 6500 
AATGCCCAAA AAAGAACATT AGAAGACACG TAAGCTCAGT TGGTCTCTGC 
CACTAAGACC AGCCAACAGA AGCTTGATTT TATTCAAACT TTGCATTTTA 6600 
GCATATTTTA TCTTGGAAAA TTCAATTGTG TTGGTTTTTT GTTTTTGTTT 
A 

GTATTGAATA GACTCTCAGA AATCCAATTG TTGA GTAAAT CTTCT GGGT T 6700 
TTCTAACCTT TCTTTAGATG TTAggCggTG" TGAGGAGGgA TTCTTC^Gg 
jTTGCTGTTCC TAGAAAGTTT ACGCCTAACT GGCTTAGTGT CTTGGTAGACf j|§P.Q 
AATTTGCCTG -0GACCAAAGT AAAfiGCAGAG AG^CTAGAGA <^ATAa|aC^"' 

I • • . .;, - * ■ . 

GCAACACAGC TCACAAGAAC AGACTTTCCA GCTGCTGAAG TTA TGGAAA C $J>9J& 

M . ■ & '• Of 

ATCAAAACAA AGACCAAGAT ATAGTCAAGA A GATCATCGA AGGTATGATA 

C 

ATCTAAAATA AAAAGATCAA TCAGAAATCA AAGACACCTA TTTATCATAA 7000 
ACCAGGAACA AGACTGCATG TATGTTTAGT TGTGTGGATC TTGTTTCCCT 
GTTGGAATCA TTGTTGGACT GAAAAAGTTT CCACCTGATA ATGTAGATGT 7100 
GATTCCACAA ACAGTTATAC AAGGTTTTGT TCTCACCCCT GCTCCCCAGT 
TTCCTTGTAA AGTATGTTGA ACACTCTAAG AGAAGAGAAA TGCATTTGAA 7200 
GGCAGGGCTG TATCTCAGGG AGTCGCTTCC AGATCCCTTA ACGCTTCTGT 
AAGCAGCCCC TCTAGACCAC CAAGGAGAAG CTCTATAACC ACTTTGTATC 7300 
TTACATTGCA CCTCTACCAA GAAGCTCTGT TGTATTTACT TGGTAATTCT 
CTCCAGGTAG GCTTTTCGTA GCTTACAAAT ATGTTCTTAT TAATCCTCAT 7400 
GATATGGCCT GCATTAAAAT TATTTTAATG GCATATGTTA TGAGAATTAA 
TGAGATAAAA TCTGAAAAGT GTTTGAGCCT CTTGTAGGAA AAAGCTAGTT 750 0 

ACAGCAAAAT GTTCTCACAT CTTATAAGTT TATATAAAGA TTCTCCTTTA 
GAAATGGTGT GAGAGAGAAA CAGAGAGAGA TAGGGAGAGA AGTGTGAAAG 7600 
AATCTGAAGA AAAGGAGTTT CATCCAGTGT GGACTGTAAG CTTTACGACA 
CATGATGGAA AGAGTTCTGA CTTCAGTAAG CATTGGGAGG ACATGCTAGA 7700 
AGAAAAAGGA AGAAGAGTTT CCATAATGCA GACAGGGTCA GTGAGAAATT 
CATTCAGGTC CTCACCAGTA GTTAAATGAC TGTATAGTCT TGCACTACCC 7800 
TAAAAAACTT CAAGTATCTG AAAC CGGGGC AACAGATTTT AGGAGACCAA 
CGTCTTTGAG AGCTGATTGC TTTTGCTTAT GCAAAGAGTA AACTTTTATG 7900 
TTTTGAGCAA AC CAAAAGT A TTCTTTGAAC GTATAATTAG CCCTGAAGCC 
GAAAGAAAAG AGAAAATCAG AGAC CGTTAG AATTGGAAGC AACCAAATTC 800 0 

CCTAT TTTAT AAATGAGGAC ATTTTAACCC AGAAAGATGA ACCGATTTGG 
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CTTAGGGCTC ACAGATACTA AGTGACTCAT GTCATTAATA GAAATGTTAG 8100 
TTCCTCCCTC TTAGGTTTGT ACCCTAGCTT ATTACTGAAA TATTCTCTAG 
GCTGTGTGTC TCCTTTAGTT CCTCGACCTC ATGTCTTTGA GTTTTCAGAT 8200 
ATCCTCCTCA TGGAGGTAGT CCTCTGGTGC TATGTGTATT CTTTAAAGGC 
TAGTTACGGC AATTAACTTA TCAACTAGCG CCTACTAATG AAACTTTGTA 8300 
A 

TTACAAAGTA GCTAACTTGA ATACTTTCCT TTTTTTCTGA AATGTTATGG 
TGGTAATTTC TCAAACTTTT TCTTAGAAAA CTGAGAGTGA TGTGTCTTAT 8400 

C 

TTTCTACTGT TAATTTTCAA AATTAGGAGC TTCTTCCAAA GTTTTGTTGG 
ATGCCAAAAA TATATAGCAT ATTATCTTAT TATAACAAAA AATATTTATC 8500 
TCAGTTCTTA GAAATAAATG GTGTCACTTA ACTCCCTCTC AAAAGAAAAG 
GTTATCATTG AAATATAATT ATGAAATTCT GCAAGAACCT TTTGCCTCAC 8600 
GCTTGTTTTA TGATGGCATT GGATGAATAT AAATGATGTG AACACTTATC 

T _ 

TGGGCTTTTG CTTTATGCAG 5ATATTGACCT CTGTGAAAAC AGCGTGCAGC " 870p| 

™ i --^ 7*™^ _ _ : -4 t f j j;7" ~ 

jSG'CACATTGG ACg^C^^^j^ACCTTCG AGCAGCTTCG TAGCTOGATG 
(SAAAGCTT AC CGGGAAA^A^^^GGAG^l C^G AC^TTC^ 8 80g] 

A^GGCATGC AAACCCAGTG AC^GATCCT GAAGC TGCTC '^f TTGTGGg 
GAATAAAAAA f^CGACCAA '.GA^CCTTgA AG^CCT^^ _ 890 5j 

J^G^CTGAA" /j[gA^TS51CA : ; CTTTCCGAAA ACTGTCAGTG! ISGAGTCTAAS^ 

iGAAGA CCATC. AGGT T CCTTC ACAGC TT CAC AATGTA CAAA TTGTATCAGA" "900Qj 

| >-T ' " Z 

AGTTATTTTT AGAAATGATA GGTAACCAGG T CCAATCAGT TiAAAATAAGQ 

! _£ " 

TGCTT ATAAl C TGGAAATGGC CATTGAGCTG TTTCCTCACA ATTGGCGAGA 9100 
TCCCATGGAT GAGTAAACTG TTTCTCAGGC ACTTGAGGCT TTCAGTGATA 
TCTTTCTCAT TACCAGTGAC TAATTTTGCC ACAGGGTACT AAAAGAAACT 9200 
ATGATGTGGA GAAAGGACTA ACATCTCCTC CAATAAACCC CAAATGGTTA 
ATCCAACTGT CAGATCTGGA TCGTTATCTA CTGACTATAT TTTCCCTTAT 9300 
TACTGCTTGC AGTAATTCAA CTGGAAATTA AAAAAAAAAA ACTAGACTCC 
ACTGGGCCTT ACTAAATATG GGAATGTCTA ACTTAAATAG CTTTGGGATT 9400 
CCAGCTATGC TAGAGGCTTT TATTAGAAAG CCATATTTTT TTCTGTAAAA 
GTTACTAATA TATCTGTAAC ACTATTACAG TATTGCTATT TATATTCATT 9500 
CAGATATAAG ATTTGGACAT ATTATCATCC TATAAAGAAA CGGTATGACT 
TAATTTTAGA AAGAAAATTA TATTCTGTTT ATTATGACAA ATGAAAGAGA 9600 
AAATATATAT TTTTAATGGA AAGTTTGTAG CATTTTTCTA ATAGGTACTG 
CCATATTTTT CTGTGTGGAG TATTTTTATA ATTTTATCTG TATAAGCTGT 9700 
AATATCATTT TATAGAAAAT GCATTATTTA GTCAATTGTT TAATGTTGGA 
AAACATATGA AATATAAATT ATCTGAATAT TAGATGCTCT GAGAAATTGA 9800 
ATGTACCTTA TTTAAAAGAT TTTATGGTTT TATAACTATA TAAATGACAT 
TATTAAAGTT TTCAAATTA 9869 
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ATGA ACAAGT TGCTGTGCTG CGCGCTCGTG 
C 

GTGGACCACC CAGGAAACGT TTCCTCCAAA 
AAACCTCTCA TCAGCTGTTG TGTGACAAAT 
AAACAACACT GTACAGCAAA GTGGAAGACC 
CCACTACTAC ACAGACAGCT GGCACACCAG 
GCCCCGTGTG CAAGGAGCTG CAGTACGTCA 
CACAACCGCG TGTGCGAATG CAAGGAAGGG 
CTGCTTGAAA CATAGGAGCT GCCCTCCTGG 
GAACCCCAGA GCGAAATACA GTTTGCAAAA 
TCAAATGAGA CGTCATCTAA AGCACCCTGT 
TGTCTTTGGT CTCCTGCTAA CTCAGAAAGG 
TATGTTCCGG AAACAGTGAA TCAACTCAAA 
CTGTGTGAGG AGGCATTCTT CAGGTTTGCT 
TAACTGGCTT AGTGTCTTGG TAGACAATTT 

CAGAGAGTGT AGAGAGGATA AAACGGCAAC 
A G 

TTCCAGCTGC TGAAGTTATG GAAACATCAA 

G 

CAAGAAGATC ATCCAAGATA TTGACCTCTG 

ACATTGGACA TGCTAACCTC ACCTTCGAGC 
AGCTTACCGG GAAAGAAAGT GGGAGCAGAA 
GGCATGCAAA CCCAGTGACC AGATCCTGAA 
TAAAAAATGG CGACCAAGAC ACCTTGAAGG 
CACTCAAAGA CGTACCACTT TCCCAAAACT 
GACCATCAGG TTCCTTCACA GCTTCACAAT 
T 

TATTTTTAGA AATGATAGGT AACCAGGTCC 
TTATAA 



TTTCTGGACA TCTCCATTAA 

GTACCTTCAT TATGACGAAG 100 
GTCCTCCTGG TACCTACCTA 
GTGTGCGCCC CTTGCCCTGA 200 
TGACGAGTGT CTATACTGCA 
AGCAGGAGTG CAATCGCACC 300 
CGCTACCTTG AGATAGAGTT 
ATTTGGAGTG GTGCAAGCTG 400 
GATGTCCAGA TGGGTTCTTC 
AGAAAACACA CAAATTGCAG 500 
AAATGCAACA CACGACAACA 
AATGTGGAAT AGATGTTACC 600 
GTTCCTACAA AGTTTACGCC 
GCCTGGCACC AAAGTAAACG 700 

T 

acAgctcaca agaacagact 
aacaaagacc aagatatagt 800 
tgaaaacagc gtgcagcggc 

A 

AGCTTCGTAG CTTGATGGAA 900 
GACATTGAAA AAACAATAAA 
GCTGCTCAGT TTGTGGCGAA 1000 
GCCTAATGCA CGCACTAAAG 
GTCACTCAGA GTCTAAAGAA 1100 
GTACAAATTG TAT CAGAAGT 

C 

AATCAGTAAA AATAAGCTGC 1200 

1206 
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MNKLLCCALV FLDISIKWTT QETFPPKYLH 
N 

KQHCTAKWKT VCAPCPDHYY TDSWHTSDEC 
HNRVCECKEG RYLEIEFCLK HRSCPPGFGV 
SNETSSKAPC RKHTNCSVFG LLLTQKGNAT 
LCEEAFFRFA VPTKFTPNWL SVLVDNLPGT 

FQLLKLWKHQ NKDQDIVKKI IQDIDLCENS 

SLPGKKVGAE DIEKTIKACK PSDQILKLLS 
HSKTYHFPKT VTQSLKKTIR FLHSFTMYKL 

S 

L 



YDEETSHQLL CDKCPPGTYL 

LYCSPVCKEL QYVKQECNRT 100 
VQAGTPERNT VCKRCPDGFF 
HDNICSGNSE STQKCGIDVT 200 
KVNAESVERI KRQHSSQEQT 
M 

VQRHIGHANL TFEQLRSLME 300 
M 

LWRIKNGDQD TLKGLMHALK 
YQKLFLEMIG NQVQSVKISC 400 

401 
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SEQUENCE LISTING 

<110> Genaissance Pharmaceuticals, Inc. 
Chew, Anne 
Denton, R. Rex 
Duda , Amy 

Nandabalan, Krishnan 
Stevens, J. Claiborne 

<120> Drug Target Isogenes: Polymorphisms in the 
Osteoclastogenesis Inhibitory Factor 

<130> MWH-001PCT TNFRSF11B-OCIF 

<140> TBA 

<141> 2000-07-10 

<150> 60/143,020 
<151> 1999-07-09 

<160> 245 

<170> Patentln Ver. 2.1 

<210> 1 

<211> 1681 

<212> DNA 

<213> Homo sapiens 

<400> 1 

ctggagacat ataacttgaa cacttggccc tgatggggaa gcagctctgc agggactttt 60 
tcagccatct gtaaacaatt tcagtggcaa cccgcgaact gtaatccatg aatgggacca 120 
cactttacaa gtcatcaagt ctaacttcta gaccagggaa ttaatggggg agacagcgaa 180 
ccctagagca aagtgccaaa cttctgtcga tagcttgagg ctagtggaaa gacctcgagg 24 0 
aggctactcc agaagttcag cgcgtaggaa gctccgatac caatagccct ttgatgatgg 300 
tggggttggt gaagggaaca gtgctccgca aggttatccc tgccccaggc agtccaattt 360 
tcactctgca gattctctct ggctctaact accccagata acaaggagtg aatgcagaat 4 20 
agcacgggct ttagggccaa tcagacatta gttagaaaaa ttcctactac atggtttatg 480 
taaacttgaa gatgaatgat tgcgaactcc ccgaaaaggg ctcagacaat gccatgcata 540 
aagaggggcc ctgtaatttg aggtttcaga acccgaagtg aaggggtcag gcagccgggt 600 
acggcggaaa ctcacagctt tcgcccagcg agaggacaaa ggtctgggac acactccaac 660 
tgcgtccgga tcttggctgg atcggactct cagggtggag gagacacaag cacagcagct 720 
gcccagcgtg tgcccagccc tcccaccgct ggtcccggct gccaggaggc tggccgctgg 780 
cgggaagggg ccgggaaacc tcagagcccc gcggagacag cagccgcctt gttcctcagc 840 
ccggtggctt ttttttcccc tgctctccca ggggacagac accaccgccc cacccctcac 900 
gccccacctc cctgggggat cctttccgcc ccagccctga aagcgttaat cctggagctt 960 
tctgcacacc ccccgaccgc. tcccgcccaa gcttcctaaa aaagaaaggt gcaaagtttg 1020 
gtccaggata gaaaaatgac tgatcaaagg caggcgatac ttcctgttgc cgggacgcta 1080 



1 



WO 01/04137 



PCT/US00/18803 



tatataacgt gatgagcgca cgggctgcgg agacgcac'cg gagcgctcgc ccagccgccg 114 0 

cctccaagcc cctgaggttt ccggggacca caatgaacaa gttgctgtgc tgcgcgctcg 1200 

tggtaagtcc ctgggccagc cgacgggtgc ccggcgcctg gggaggctgc tgccacctgg 1260 

tctcccaacc tcccagcgga ccggcgggga gaaggctcca ctcgctccct cccaggagag 1320 

gcttggggtt aggctggagc aggaaaccgc tttcaagtta tgccatgctt cccctagggt 1380 

gtccttttac gctgcaaagt tcctgctgac tttatggaag acagcaagag agagacagac 14 4 0 

agcgagagag agggagagag agagagagag aaacttgttt gaaagtttta gtcattaacc 1500 

ttctgtcttc atctcagaat attaacgccc tcatgtagtc catactatct ttgcttaatg 1560 

aacttgaact t-ttattatta gtggcaaaga agtggtccct tagattcaga gtaagttgga 1620 

agaagacgtt agtcttctta aaaccattat aattagaata tgacatgata gatttttcta 1680 

a 1681 



<210> 2 

<211> 9869 

<212> DNA 

<213> Homo sapiens 

<400> 2 

aagcttactt tgtgccaaat ctcattaggc ttaaggtaat acaggacttt gagtcaaatg 60 

atactgttgc acataagaac aaacctattt tcatgctaag atgatgccac tgtgttcctt 120 

tctccttcta gtttctggac atctccatta agtggaccac ccaggaaacg tttcctccaa 180 

agtaccttca ttatgacgaa gaaacctctc atcagctgtt gtgtgacaaa tgtcctcctg 24 0 

gtacctacct aaaacaacac tgtacagcaa agtggaagac cgtgtgcgcc ccttgccctg 300 

accactacta cacagacagc tggcacacca gtgacgagtg tctatactgc agccccgtgt 360 

gcaaggagct gcagtacgtc aagcaggagt gcaatcgcac ccacaaccgc gtgtgcgaat 420 

gcaaggaagg gcgctacctt gagatagagt tctgcttgaa acataggagc tgccctcctg 480 

gatttggagt ggtgcaagct ggtacgtgtc aatgtgcagc aaaattaatt aggatcatgc 540 

aaagtcagat agttgtgaca gtttaggaga acacttttgt tctgatgaca ttataggata 600 

gcaaattgca aaggtaatga aacctgccag gtaggtacta tgtgtctgga gtgcttccaa 660 

aggaccattg ctcagaggaa tactttgcca ctacagggca atttaatgac aaatctcaaa 720 

tgcagcaaat tattctctca tgagatgcat gatggttttt tttttttttt ttaaagaaac 780 

aaactcaagt tgcactattg atagttgatc tatacctcta tatttcactt cagcatggac 840 

accttcaaac tgcagcactt tttgacaaac atcagaaatg ttaatttata ccaagagagt 900 

aattatgctc atattaatga gactctggag tgctaacaat aagcagttat aattaattat 960 

gtaaaaaatg agaatggtga ggggaattgc atttcattat taaaaacaag gctagttctt 1020 

cctttagcat gggagctgag tgtttgggag ggtaaggact atagcagaat ctcttcaatg 1080 

agcttattct ttatcttaga caaaacagat tgtcaagcca agagcaagca cttgcctata 1140 

aaccaagtgc tttctctttt gcattttgaa cagcattggt cagggctcat gtgtattgaa 1200 

tcttttaaac cagtaaccca cgtttttttt ctgccacatt tgcgaagctt cagtgcagcc 1260 

tataactttt catagcttga gaaaattaag agtatccact tacttagatg gaagaagtaa 1320 

tcagtataga ttctgatgac tcagtttgaa gcagtgtttc tcaactgaag ccctgctgat 1380 

attttaagaa atatctggat tcctaggctg gactcctttt tgtgggcagc tgtcctgcgc 144 0 

attgtagaat tttggcagca cccctggact ctagccacta gataccaata gcagtccttc 1500 

ccccatgtga cagccaaaaa tgtcttcaga cactgtcaaa tgtcgccagg tggcaaaatc 1560 

actcctggtt gagaacaggg tcatcaatgc taagtatctg taactatttt aactctcaaa 1620 

acttgtgata tacaaagtct aaattattag acgaccaata ctttaggttt aaaggcatac 1680 

aaatgaaaca ttcaaaaatc aaaatctatt ctgtttctca aatagtgaat cttataaaat 1740 
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taatcacaga agatgcaaat tgcatcagag 
tgagggagga attggtgata gttcctactt 
ctaagctaag ttgtgtgtgt gtcagggtgc 
atccatgtaa ttcattcagt aagttgtata 
ttggaaacta gagaattttg aaaaataatg 
agaaaatctg tttgtagaat gaagcaagca 
cattttactc tgtgtacact ggc^gcacag 
acccacacag cggttcctct tgggaaataa 
tatgtggtgt tactctaaaa agtatttaat 
aaatcagatt gtctcctctc catattttat 
acttagagca agcatggtga attctcaact 
tcacattttg cctggcaggt tataattttt 
cattacttag atggatagat ctttttcata 
tgttcatgtt caggaaggtc attagataaa 
tgtcattctt agattctttt tgttaaataa 
tctgacacat atgaacttct cattaggatg 
ctgaagaatg aacaagattc ttaggcccgg 
tttgagaggt caaggcgggc agatcacctg 
catgatgaaa ccctgcctct actaaaaata 
ctgcaaccct agctactcag gaggctgaga 
ggttgtggtg agctgagatc cctctactgc 
cgtccctgcc gccgcccccg ccttcccccc 
ggcagtcaac aaagggagac ctgggtccag 
gcaatgaaag aatgccatgg aatccctgcc 
ttgatataga gttgtatccc atttaaggag 
acacacaaac agaaaaccct ctttgctttg 
atgctggaaa taatatttaa tatgtgaagg 
tttttctgcc agccctttgt catttttgca 
gatgaaacta. gaaccagtcc attttgcccc 
acaatgaggt aggaggttga gatttataaa 
ttttctcttt catatttgtt atcttgpata 
ggatattgaa gtctaaatct gttcaactag 
agatacactg gaatgtatga tctagccatg 
atttttaata gcgtctttag ttgtggactg 
aatttatcaa atatttttcc atcatgaagt 
tttgaacgac tctgctgttt taaacagttt 
atgtagctta actgcaggct tacgcttttg 
aaagtttatt ataatgttgt aaatttttac 
tcacaattag gattcaggaa agaaagaact 
cagcattcaa tgggtactaa tttcaaagaa 
ttgattttct aggaataatt gtatgaagaa 
"cagcggaggc tggactaatg aacaccctac 
gcgttttgta ggtaacgaga aaattgactt 
aaaggggatg atggtggaag ttttgttctg 
gttttgtgca acataatagt agcagtaaaa 
taagagggca tctgctggga aacgatttga 
cgtaggaacc ccagagcgaa atacagtttg 
tgagacgtca tctaaagcac cctgtagaaa 



tcccttaaaa ttcctcttcg tatgagtatt 1800 
tctattggat ggtactttga gactcaaaag 1860 
ggggtgtgga atcccatcag ataaaagcaa 1920 
tgtagaaaaa tgaaaagtgg gctatgcagc 1980 
gaaatcacaa ggatctttct taaataagta 2040 
ggcagccaga agactcagaa caaaagtaca 2100 
tgggatttat ttacctctcc ctccctaaaa 2160 
gaggtttcca gcccaaagag aaggaaagac 2220 
aaccgttttg ttgttgctgt tgctgttttg 2280 
ttacttcatt ctgttaattc ctgtggaatt 2340 
gtaaagccaa atttctccat cattataatt 2400 
atatttccac tgatagtaat aaggtaaaat 24 60 
aaaagtacca tcagttatag agggaagtca 2520 
gcttctgaat atattatgaa acattagttc 2580 
ctttaaaagc taacttacct aaaagaaata 2640 
caggagaaga cccaagccac agatatgtat 2700 
cacggtggct cacatctgta. atctcaagag 2760 
aggtcaggag ttcaagacca gcctggccaa 2820 
caaaaattag cagggcatgg tggtgcatgc 2880 
caggagaatc tcttgaaccc tcgaggcgga 2940 
actccagcct gggtgacaga gatgagactc 3000 
caaaaagatt cttcttcatg cagaacatac 3060 
gtgtccaagt cacttatttc gagtaaatta 3120 
caaatacctc tgcttatgat attgtagaat 3180 
taggatgtag taggaaagta ctaaaaacaa 324 0 
taaggtggtt cctaagataa tgtcagtgca 3300 
ttttaggctg tgttttcccc tcctgttctt 3360 
ggtcaatgaa tcatgtagaa agagacagga 3420 
tttttttatt ttctggtttt ggtaaaagat 3480 
tgaagtttaa taagtttctg tagctttgat 354 0 
agccagaatt ggcctgtaaa atctacatat 3600 
cttacactag atggagatat tttcatattc 3660 
cgtaatatag tcaagtgttt gaaggtattt 3720 
gttcaagttt ttctgccaat gatttcttca 3780 
aaaatgccct tgcagtcacc cttcctgaag 3840 
aagcaaatgg tatatcatct tccgtttact 3900 
agtcagcggc caactttatt gccaccttca 3960 
ttctcaaggt tagcatactt aggagttgct 4 020 
tcagtaggaa ctgattggaa tttaatgatg 4 080 
tgatattaca gcagacacac agcagttatc 414 0 
tatggctgac aacacggcct tactgccact 4200 
ccttctttcc tttcctctca catttcatga 4260 
gcatttgcat tacaaggagg agaaactggc 4 320 
tctaatgaag tgaaaaatga aaatgctaga 4 38 0 
accaagtgaa aagtctttcc aaaactgtgt 4440 
ggagaaggta ctaaattgct tggtattttc 4 500 
caaaagatgt ccagatgggt tcttctcaaa 4 560 
acacacaaat tgcagtgtct ttggtctcct 4 620 
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gctaactcag aaaggaaatg caacacacga 
tcaaaaatgt ggaataggta attacattcc 
catctctctc tctgagttga acacaaggcc 
ttccctttct tgaatcttaa ccagctaagg 
cactcagaat ctctcaaaaa ctcatcttct 
ctcctttcac actgaaatca aatcttgccc 
tgagatgaaa ttaggagagt ccaaactgta 
aatgtctgta ttattaacta aagtatatat 
tgatgacaaa ttaggccagg catggtggct 
caaggtaggc agatcacttg aggtcaggat 
ccttgtctct actaaaaata caaaaattag 
agctactcag ggctgaggca ggagaatcgc 
ctgagattgt accactgcac tccagtctgg 
cacacacaca cacacacaca cattagaaat 
gtgcatctat tgcatggaac ttccaagcta 
caggtcacta gtattaagtt caggttattc 
atcaggctag tgtgtgtgtt caccttgtca 
actcaaagac acattacact aaagatgatt 
taaaccagct tgactctccc caaacagttt 
gaaatgtgaa ttgatata'ta tatgagattc 
aattgaaatc atagacaagc cattttagcc 
aaaatgaagg aaggggtatt aaaaggagtg 
catttttaat tttacttttt ttcatttatt 
aggctttaac atttataaaa acactgtgaa 
cggcagaact agtattcaaa gccaggtctg 
cattttctgg gacatactta ctctacccag 
taacatagtt ttatgtttgg ttattttcct 
tctcttgctt tgtttccaaa ggtaaactat 
tattccaaat tactgttcaa attcctttaa 
tcatgaagtt ccctgtgggt actaggtaaa 
ttataagaat ttttggctgt tacttattta 
ctaaactttc tcttgaaaac aatgcccaaa 
tggtctctgc cactaagacc agccaacaga 
gcatatttta tcttggaaaa ttcaattgtg 
gactctcaga aatccaattg ttgagtaaat 
ttaccctgtg tgaggaggca ttcttcaggt 
ggcttagtgt cttggtagac aatttgcctg 
ggataaaacg gcaacacagc tcacaagaac 
atcaaaacaa agaccaagat atagtcaaga 
aaaagatcaa tcagaaatca aagacaccta 
tatgtttagt tgtgtggatc ttgtttccct 
ccacctgata atgtagatgt gattccacaa 
gctccccagt ttccttgtaa agtatgttga 
ggcagggctg tatctcaggg agtcgcttcc 
tctagaccac caaggagaag ctctataacc 
gaagctctgt tgtatttact tggtaattct 
atgttcttat taatcctcat gatatggcct 
tgagaattaa tgagataaaa tctgaaaagt 



caacatatgt tccggaaaca gtgaatcaac 4 680 
aaaatacgtc tttgtacgat tttgtagtat 4740 
tccagccaca ttcttggtca aacttacatt 4 800 
ctactctcga tgcattactg ctaaagctac 4860 
cacagataac acctcaaagc ttgattttct 4 920 
ataggcaaag ggcagtgtca agtttgccac 4 980 
gaattcacgt tgtgtgttat tactttcacg 5040 
tggcaactaa gaagcaaagt gatataaaca 5100 
tactcctata atcccaacat tttggggggc 5160 
ttcaagacca gcctgaccaa catggtgaaa 5220 
ctgggcatgg tagcaggcac ttctagtacc 5280 
ttgaacccag gagatggagg ttgcagtgag 5340 
gcaacagagc aagatttcat cacacacaca 5400 
gtgtacttgg ctttgttacc tatggtatta 5460 
ctctggttgt gttaagctct tcattgggta 5520 
ggatgcattc cacggtagtg atgacaattc 5580 
ctcccaccac tagactaatc tcagaccttc 5640 
tgcttttttg tgtttaatca agcaatggta 5700 
ttcgtactac aaagaagttt atgaagcaga 57 60 
taacccagtt ccagcattgt ttcattgtgt 5820 
tttgctttct tatctaaaaa aaaaaaaaaa 5880 
atcaaatttt aacattctct ttaattaatt 5940 
gtgcacttac tatgtggtac tgtgctatag 6000 
agttgcttca gatgaatata ggtagtagaa 6060 
atgaatccaa aaacaaacac ccattactcc 6120 
atgctctggg ctttgtaatg cctatgtaaa 6180 
atgtaatgtc tacttatata tctgtatcta 6240 
gtgtctaaat gtgggcaaaa aataacacac 6300 
gtcagtgata attatttgtt ttgacattaa 6360 
cctttaatag aatgttaatg tttgtattca 6420 
caacaatatt tcactctaat tagacattta 6480 
aaagaacatt agaagacacg taagctcagt 6540 
agcttgattt tattcaaact ttgcatttta 6600 
ttggtttttt gtttttgttt gtattgaata 6660 
cttctgggtt ttctaacctt tctttagatg 6720 
ttgctgttcc tacaaagttt acgcctaact 6780 
gcaccaaagt aaacgcagag agtgtagaga 6840 
agactttcca gctgctgaag ttatggaaac 6900 
agatcatcca aggtatgata atctaaaata 6960 
tttatcataa accaggaaca agactgcatg 7020 
gttggaatca ttgttggact gaaaaagttt 7080 
acagttatac aaggttttgt tctcacccct 7140 
acactctaag agaagagaaa tgcatttgaa 7200 
agatccctta acgcttctgt aagcagcccc 7260 
actttgtatc ttacattgca cctctaccaa 7320 
ctccaggtag gcttttcgta gcttacaaat 7380 
gcattaaaat tattttaatg gcatatgtta 7440 
gtttgagcct cttgtaggaa aaagctagtt 7500 
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acagcaaaat gttctcacat cttataagtt tatataaaga ttctccttta gaaatggtgt 7560 
gagagagaaa cagagagaga tagggagaga agtgtgaaag aatctgaaga aaaggagttt 7620 
catccagtgt ggactgtaag ctttacgaca catgatggaa agagttctga cttcagtaag 7 680 
cattgggagg acatgctaga agaaaaagga agaagagttt ccataatgca gacagggtca 7740 
gtgagaaatt cattcaggtc ctcaccagta gttaaatgac tgtatagtct tgcactaccc 7800 
taaaaaactt caagtatctg aaaccggggc aacagatttt aggagaccaa cgtctttgag 7860 
agctgattgc ttttgcttat gcaaagagta aacttttatg ttttgagcaa accaaaagta 7 920 
ttctttgaac gtataattag ccctgaagcc gaaagaaaag agaaaatcag agaccgttag 7 980 
aattggaagc aaccaaattc cctattttat aaatgaggac attttaaccc agaaagatga 8040 
accgatttgg cttagggctc acagatacta agtgactcat gtcattaata gaaatgttag 8100 
ttcctccctc ttaggtttgt accctagctt attactgaaa tattctctag gctgtgtgtc 8160 
tcctttagtt cctcgacctc atgtctttga gttttcagat atcctcctca tggaggtagt 8220 
cctctggtgc tatgtgtatt ctttaaaggc tagttacggc aattaactta tcaactagcg 8280 
cctactaatg aaactttgta ttacaaagta gctaacttga atactttcct . ttttttctga 8340 
aatgttatgg tggtaatttc tcaaactttt tcttagaaaa ctgagagtga tgtgtcttat 8400 
tttctactgt taattttcaa aattaggagc ttcttccaaa gttttgttgg atgccaaaaa 84 60 
tatatagcat attatcttat tataacaaaa aatatttatc tcagttctta gaaataaatg 8520 
gtgtcactta actccctctc aaaagaaaag gttatcattg aaatataatt atgaaattct 8580 
gcaagaacct tttgcctcac gcttgtttta tgatggcatt ggatgaatat aaatgatgtg 8640 
aacacttatc tgggcttttg ctttatgcag atattgacct ctgtgaaaac agcgtgcagc 8700 
ggcacattgg acatgctaac ctcaccttcg agcagcttcg tagcttgatg gaaagcttac 8760 
cgggaaagaa agtgggagca gaagacattg aaaaaacaat aaaggcatgc aaacccagtg 8820 
accagatcct gaagctgctc agtttgtggc gaataaaaaa tggcgaccaa gacaccttga 8880 
agggcctaat gcacgcacta aagcactcaa agacgtacca ctttcccaaa actgtcactc 8940 
agagtctaaa gaagaccatc aggttccttc acagcttcac aatgtacaaa ttgtatcaga 9000 
agttattttt agaaatgata ggtaaccagg tccaatcagt aaaaataagc tgcttataac 9060 
tggaaatggc cattgagctg tttcctcaca attggcgaga tcccatggat gagtaaactg 9120 
tttctcaggc acttgaggct ttcagtgata tctttctcat taccagtgac taattttgcc 9180 
acagggtact aaaagaaact atgatgtgga gaaaggacta acatctcctc caataaaccc 9240 
caaatggtta atccaactgt cagatctgga tcgttatcta ctgactatat tttcccttat 9300 
tactgcttgc agtaattcaa ctggaaatta aaaaaaaaaa actagactcc actgggcctt 9360 
actaaatatg ggaatgtcta acttaaatag ctttgggatt ccagctatgc tagaggcttt 9420 
tattagaaag ccatattttt ttctgtaaaa gttactaata tatctgtaac actattacag 9480 
tattgctatt tatattcatt cagatataag atttggacat attatcatcc tataaagaaa 9540 
cggtatgact taattttaga aagaaaatta tattctgttt attatgacaa atgaaagaga 9600 
aaatatatat ttttaatgga aagtttgtag catttttcta ataggtactg ccatattttt 9660 
ctgtgtggag tatttttata attttatctg tataagctgt aatatcattt tatagaaaat 9720 
gcattattta gtcaattgtt taatgtt.gga aaacatatga aatataaatt atctgaatat 9780 
tagatgctct gagaaattga . atgtacctta tttaaaagat tttatggttt tataactata 9840 
taaatgacat tattaaagtt ttcaaatta . 9869 

<210> 3 

<211> 1206 

<212> DNA 

<213> Homo sapiens 

<400> 3 
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atgaacaagt tgctgtgctg cgc'gctcgtg 
caggaaacgt ttcctccaaa gtaccttcat 
tgtgacaaat gtcctcctgg tacctaccta 
gtgtgcgccc cttgccctga ccactactac 
ctatactgca gccccgtgtg caaggagctg 
cacaaccgcg tgtgcgaatg caaggaaggg 
cataggagct gccctcctgg atttggagtg 
gtttgcaaaa gatgtccaga tgggttcttc 
agaaaacaca caaattgcag tgtctttggt 
cacgacaaca tatgttccgg aaacagtgaa 
ctgtgtgagg aggcattctt caggtttgct 
agtgtcttgg tagacaattt gcctggcacc 
aaacggcaac acagctcaca agaacagact 
aacaaagacc aagatatagt caagaagatc 
gtgcagcggc acattggaca tgctaacctc 
agcttaccgg gaaagaaagt gggagcagaa 
cccagtgacc agatcctgaa gctgctcagt 
accttgaagg gcctaatgca cgcactaaag 
gtcactcaga gtctaaagaa gaccatcagg 
tatcagaagt tatttttaga aatgataggt 
ttataa 



tttctggaca 


tctccattaa 


gtggaccacc 


60 


tatgacgaag 


aaacctctca 


tcagctgttg 


120 


aaacaacact 


gtacagcaaa 


gtggaagacc 


180 


acagacagct 


ggcacaccag 


tgacgagtgt 


240 


cagtacgtca 


agcaggagtg 


caatcgcacc 


300 


cgctaccttg 


agatagagtt 


ctgcttgaaa 


360 


gtgcaagctg 


gaaccccaga 


gcgaaataca 


420 


tcaaatgaga 


cgtcatctaa 


agcaccctgt 


480 


ctcct'gctaa 


ctcagaaagg 


aaatgcaaca 


540 


tcaactcaaa 


aatgtggaat 


agatgttacc 


600 


gttcctacaa 


agtttacgcc 


taactggctt 


660 


aaagtaaacg 


cagagagtgt 


agagaggata 


720 


ttccagctgc 


tgaagttatg 


gaaacatcaa 


780 


atccaagata 


ttgacctctg 


tgaaaacagc 


840 


accttcgagc 


agcttcgtag 


cttgatggaa 


900 


gacattgaaa 


aaacaataaa 


ggcatgcaaa 


960 


ttgtggcgaa 


taaaaaatgg 


cgaccaagac 


1020 


cactcaaaga 


cgtaccactt 


tcccaaaact 


1080 


ttccttcaca 


gcttcacaat 


gtacaaattg 


1140 


aaccaggtcc 


aatcagtaaa 


aataagctgc 


1200 
1206 



<210> 4 
<211> 401 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Asn Lys Leu Leu Cys Cys Ala Leu Val Phe Leu Asp lie Ser lie 
15 10 15 

Lys Trp Thr Thr Gin Glu Thr Phe Pro Pro Lys Tyr Leu His Tyr Asp 
20 25 30 

Glu Glu Thr Ser His Gin Leu Leu Cys Asp Lys Cys Pro Pro Gly Thr 
35 40 45 

Tyr Leu Lys Gin His Cys Thr Ala Lys Trp Lys Thr Val Cys Ala Pro 
50 55 60 

Cys Pro Asp His Tyr Tyr Thr Asp Ser Trp His Thr Ser Asp Glu Cys 
65 70 75 80 

Leu Tyr Cys Ser Pro Val Cys Lys Glu Leu Gin Tyr Val Lys Gin Glu 
85 90 95 

Cys Asn Arg Thr His Asn Arg Val Cys Glu Cys Lys Glu Gly Arg Tyr 
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100 



105 



110 



Leu Glu lie Glu Phe Cys Leu Lys His Arg Ser Cys Pro Pro Gly Phe 
115 120 125 

Giy Val Val Gin Ala Gly Thr Pro Glu Arg Asn Thr Val Cys Lys Arg 
130 135 140 

Cys Pro Asp Gly Phe Phe Ser Asn Glu Thr Ser Ser Lys Ala Pro Cys 
145 150 155 160 

Arg Lys His Thr Asn Cys Ser Val Phe Gly Leu Leu Leu Thr Gin Lys 
165 170 175 

Gly Asn Ala Thr His Asp Asn lie Cys Ser Gly Asn Ser Glu Ser Thr 
180 185 190 

Gin Lys Cys Gly lie Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg 
195 200 205 

Phe Ala Val Pro Thr Lys Phe Thr Pro Asn Trp Leu Ser Val Leu Val 
210 215 220 

Asp Asn Leu Pro Gly Thr Lys Val Asn Ala Glu Ser Val Glu Arg lie 
225 230 235 240 

Lys Arg Gin His Ser Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu 
245 250 255 

Trp Lys His Gin Asn Lys Asp Gin Asp lie Val Lys Lys lie He Gin 
260 265 270 

Asp He Asp Leu Cys Glu Asn Ser Val Gin Arg His He Gly His Ala 
275 280 285 

Asn Leu Thr Phe Glu Gin Leu Arg Ser Leu Met Glu Ser Leu Pro Gly 
290- 295 .300 

Lys Lys Val Gly Ala Glu Asp He Glu Lys Thr He Lys Ala Cys Lys 
305 310 315 320 

Pro Ser Asp Gin He Leu Lys Leu Leu Ser Leu Trp Arg He Lys Asn 
325 330 335 



Gly Asp Gin Asp Thr Leu Lys Gly Leu Met His Ala Leu Lys His Ser 
340 345 350 



Lys Thr Tyr His Phe Pro Lys Thr Val Thr Gin Ser Leu Lys Lys Thr 
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355 360 365 

lie Arg Phe Leu His Ser Phe Thr Met Tyr Lys Leu Tyr Gin Lys Leu 
370 375 380 

Phe Leu Glu Met He Gly Asn Gin Val Gin Ser Val Lys He Ser Cys 
385 390 395 400 

Leu 



<210> 5 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 5 

acttgaagat gaatg 15 



<210> 6 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 6 

acttgaaaat gaatg 15 

<210> 7 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 7 

gatcttggct ggatc 15 



<210> 8 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 8 

gatcttgtct ggatc 15 
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<210> 9 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 9 

ccaccgcccc acccc 

<210> 10 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 10 

ccaccgctcc acccc 

<210> 11 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 11 
tccctggggg atcct 



<210> 12 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 12 
tccctggtgg atcct 



<210> 13 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 13 

gcgttaatcc tggag 

<210> 14 
<211> 15 
<212> DNA 
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<213> Homo sapiens 
<400> 14 

gcgttaaccc tggag 15 



<210> 15 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 15 

cctgggccag ccgac 15 



<210> 16 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 16 • 

cctgggctag ccgac - 15 



<210> 17 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 17 

gggagaaggc tccac 

<210> 18 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 18 

gggagaaagc tccac 



<210> 19 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 19 



10 



WO 01/04137 



PCT/US00/18803 



ccttttacgc tgcaa ' 15 



<210> 20 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 20 

ccttttaagc tgcaa 15 



<210> 21 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 21 

gctggtacgt gtcaa 15 



<210> 22 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 22 

gctggtatgt gtcaa 15 



<210> 23 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 23 

aggaccattg ctcag 15 



<210> 24 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 24 

aggaccactg ctcag 15 



11 



WO 01/04137 



PCT/US00/18803 



<210> 25 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 25 

aacataatag tagca 

<210> 26 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 26 

aacataacag tagca 



<210> 27 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 27 

tattttccgt aggaa 



<210> 28 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 28 

tattttctgt aggaa 



<210> 29 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 29 

cattttagca tattt 15 



<210> 30 
<211> 15 
<212> DNA 



12 



WO 01/04137 



PCT/US00/18803 



<213> Homo sapiens 



<400> 30 



cattttaaca tattt 



15 



<210> 31 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 31 

aagtaaacgc agaga 15 

<210> 32 

<211> 15 

<212> DNA 

<213> Homo sapiens 



<210> 33 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 33 

gtgtagagag gataa 15 

<210> 34 

<211> 15 

<212> DNA 

<213> Homo sapiens 



<400> 32 



aagtaaatgc agaga 



15 



<400> 34 



gtgtagaaag gataa 



15 



<210> 35 



<211> 15 



<212> DNA 



<213> Homo sapiens 



<400> 35 



WO 01/04137 



PCT7US00/18803 



agaggataaa acggc 



15 



<210> 36 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 36 

agaggatgaa acggc 15 

<210> 37 

<211> 15 

<212> DNA 

<213> Homo sapiens 



<210> 38 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 38 

tgaagttgtg gaaac 15 

<210> 39 

<211> 15 

<212> DNA 

<213> Homo sapiens 



<400> 37 



tgaagttatg gaaac 



15 



<400> 39 



tgaagttatg gaaac 



15 



<210> 40 
<211> 15 



<212> DNA 



<213> Homo sapiens 



<400> 40 



tgaagttgtg gaaac 



15 



14 



WO 01/04137 



PCTAJSOO/18803 



<210> 41 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 41 

gtatgataat ctaaa 15 



<210> 42 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 42 

gtatgatcat ctaaa 15 



<210> 43 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 43 

tagttacggc aatta 15 



<210> 44 
<211> 15 
<212> DNA 
<213>.Homo sapiens 

<400> 44 

tagttacagc aatta 15 



<210> 45 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 45 

agagtgatgt gtctt 15 



<210> 46' 
<211> 15 
, <212> DNA 



15 



WO 01/04137 



PCT/US00/18803 



<213> Homo sapiens 



<400> 46 



agagtgacgt gtctt 



15 



<210> 47 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 47 

ggcattggat gaatat 16 

<210> 48 

<211> 16 

<212> DNA 

<213> Homo sapiens 



<210> 49 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 49 

aaacagcgtg cagcg 15 

<210> 50 

<211> 15 

<212> DNA 

<213> Homo sapiens 



<400> 48 



ggcattgtat gaatat 



16 



<400> 50 



aaacagcatg cagcg 



15 



<210> 51 



<211> 15 



<212> DNA 



<213> Homo sapiens 



<400> 51 



WO 01/04137 
aaagaagacc atcag 



PCT/US00/18803 

15 



<210> 52 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 52 

aaagaagtcc atcag 15 



<210> 53 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 53 

tcagaagtta ttttt 15 



<210> 54 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 54 

tcagaagcta ttttt 15 



<210> 55 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 55 

atgtaaactt gaaga - 15 



<210> 56 
-<211> 17 
<212> DNA 
<213> Homo sapiens 

<400> 56 

ttcgcaatca ttcatct 17 



17 



WO 01/04137 



PCT/US00/18803 



<210> 57 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 57 

atgtaaactt gaaaa 15 



<210> 58 

<211> 17 

<212> DNA 

<213> Homo sapiens 

<400> 58 

ttcgcaatca ttcattt 17 



<210> 59 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 59 

cgtccggatc ttggc 15 



<210> 60 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 60 

gagtccgatc cagcc 15 



<210> 61 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 61 

cgtccggatc ttgtc 15 



<210> 62 
<211> 15 
<212> DNA 



18 



WO 01/04137 



PCT/USOO/18803 



<213> Homo sapiens 
<400> 62 

gagtccgatc cagac 15 



<210> 63 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 63 

cagacaccac cgccc 15 



<210> 64 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 64 

gcgtgagggg tgggg 15 



<210> 65 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 65 

cagacaccac cgctc 15 



<210> 66 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 66 

gcgtgagggg tggag . * 15 



<210> 67 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 67 



19 



WO 01/04137 
cccacctccc tgggg 



PCT/USOO/18803 



15 



<210> 68 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 68 

gcggaaagga tcccc 15 



<210> 69 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 69 

cccacctccc tggtg 15 



<210> 70 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 70 

gcggaaagga tccac 15 



<210> 71 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 71 

ctgaaagcgt taatc 15 



<210> 72 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 72 

agaaagctcc aggat 15 



20 



WO 01/04137 



PCT/US00/18803 



<210> 73 

<211> 15 

<212> DNA 

<213> Homo sapiens 



<400> 73 

ctgaaagcgt taacc 



15 



<210> 74 

<211> 15 

<212> DNA 

<213> Homo sapiens 



<400> 74 

agaaagctcc agggt 



15 



<210> 75 

<211> 15 

<212> DNA 

<213> Homo sapiens 



<400> 75 

taagtccctg ggcca 



15 



<210> 76 

<211> 15 

<212> DNA 

<213> Homo sapiens 



<400> 76 

gcacccgtcg gctgg 



15 



<210> 77 

<211> 15 

<212> DNA 

<213> Homo sapiens 



<400> 77 

taagtccctg ggcta 



15 



<210> 78 
<211> 15 
<212> DNA 



21 



WO 01/04137 



PCT/USOO/18803 



<213> Homo sapiens 
<400> 78 

gcacccgtcg gctag 15 



<210>,79 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 79 

ccggcgggga gaagg 15 



<210> 80 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 80 

gagcgagtgg agcct 15 



<210> 81 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 81 

ccggcgggga gaaag 15 



<210> 82 ^ 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 82 

gagcgagtgg agctt 15 



<210> 83 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 83 



22 



WO 01/04137 
gggtgtcctt ttacg 



PCT/US00/18803 
15 



<210> 84 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 84 

ggaactttgc agcgt 15 



<210> 85 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 85 

gggtgtcctt ttaag 15 



<210> 86 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 86 

ggaactttgc agctt 15 



<210> 87 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 87 

gtgcaagctg gtacg 15 



<210> 88 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 88 

tgcacattga cacgt 15 



23 



WO 01/04137 PCTVUS00/18803 

<210> 89 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 89 

gtgcaagctg gtatg 15 



<210> 90 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 90 

tgcacattga cacat 15 



<210> 91 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 91 

ttccaaagga ccatt 15 



<210> 92 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 92 

attcctctga gcaat 15 



<210> 93 
<211> 15 
<212> DNA 

<213> Homo • sapiens 
<400> 93 

ttccaaagga ccact 15 



<210> 94 
<211> 15 
<212> DNA 



24 



WO 01/04137 

<213> Homo sapiens 

<400> 94 

attcctctga gcagt 

<210> 95 

<211> 15 

<212> DNA 

<213> Homo sapiens 



PCT/USOO/18803 



15 



<400> 95 

ttgtgcaaca taata 15 



<210> 96 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 96 

ttttactgct actat 15 



<210> 97 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 97 

ttgtgcaaca taaca 15 



<210> 98 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 98 

ttttactgct actgt 15 



<210> 99 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 99 



25 



WO 01/04137 



PCT/US00/18803 



gcttggtatt ttccg " 15 



<210> 100 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 100 

ctggggttcc tacgg 15 



<210> 101 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 101 

gcttggtatt ttctg 15. 



<210> 102 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 102 

ctggggttcc tacag 15 



<210> 103 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 103 

actttgcatt ttagc 15 



<210> 104 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 104 

aagataaaat atgct 15 



26 



WO 01/04137 PCT/US00/18803 

<210> 105 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 105 

actttgcatt ttaac 15 



<210> 106 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 106 

aagataaaat atgtt 15 



<210> 107 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 107 

gcaccaaagt aaacg 15 



<210> 108 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 108 

ctacactctc tgcgt 15 



<210> 109 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 109 

gcaccaaagt aaatg 15 



<210> 110 
<211> 15 
<212> DNA 



27 



WO 01/04137 



PCT/US00/18803 



<213> Homo 



sapiens 



<400> 110 



ctacactctc 



tgcat 



15 



<210> 111 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 111 

cagagagtgt agaga 15 

<210> 112 

<211> 13 

<212> DNA 

<213> Homo sapiens 

<400> 112 

cgcgttttat cct 13 

<210> 113 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 113 

cagagagtgt agaaa 15 

<210> 114 

<211> 13 

<212> DNA 

<213> Homo sapiens 

<400> 114 

cgcgttttat ctt 13 

<210> 115 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 115 



28 



WO 01/04137 PCT/USOO/18803 

gtgtagagag gataa , 15 



<210> 116 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 116 

tgtgttgccg tttta 15 



<210> 117 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 117 

gtgtagagag gatga 15 



<210> 118 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 118 

tgtgttgccg tttca 15 



<210> 119 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 119 

agctgctgaa gttat .15 



<210> 120 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 120 

tttgatgttt ccata 15 



29 



WO 01/04137 



PCTYUS00/18803 



<210> 121 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 121 

agctgctgaa gttgt 15 



<210> 122 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 122 

tttgatgttt ccaca 15 



<210> 123 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 123 

tccaaggtat gataa 15 



<210> 124 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 124 

ttttatttta gatta 15 



<210> 125 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 125 

tccaaggtat gatca 15 



<210> 126 
<211> 15 
<212> DNA 



30 



WO 01/04137 PCT/USOO/18803 

<213> Homo sapiens 

<400> 126 

ttttatttta gatga 15 



<210> 127 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 127 

aaaggctagt tacgg 15 



<210> 128 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 128 

tgataagtta attgc ' 15 



<210> 129 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 129 

aaaggctagt tacag 15 



<210> 130 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 130 

tgataagtta atttc 15 



<210> 131 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 131 



31 



WO 01/04137 
aaactgagag tgatg 



<210> 132 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 132 
gaaaataaga cacat 



<210> 133 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 133 
aaactgagag tgacg 



<210> 134 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 134 
gaaaataaga cacgt 



<210> 135 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 135 
tatgatggca ttgga 



<210> 136 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 136 
catttatatt catcc 



PCT/US00/18803 
15 



15 



15 



15 



15 



32 



WO 01/04137 



PCT/US00/18803 



<210> 137 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 137 

tatgatggca ttgta 15 



<210> 138 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 138 

catttatatt catac 15 



<210> 139 

<211> 15 

<212> DNA . 

<213> Homo sapiens 

<400> 139 

ctgtgaaaac agcgt 15 



<210> 140 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 140 

atgtgccgct gcacg 15 



<210> 141 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 141 

ctgtgaaaac agcat 15 



<210> 142 
<211> 15 
<212> DNA 



33 



WO 01/04137 



PCT/US00/18803 



<213> Homo 



sapiens 



<400> 142 
atgtgccgct 



gcatg 



15 



<210> 143 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 143 

gagtctaaag aagac 15 

<210> 144 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 144 

aggaacctga tggtc 15 

<210> 145 
<211> 15 
.<212> DNA 
<213> Homo sapiens 

<400> 145 

gagtctaaag aagtc 15 

<210> 146 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 146 

aggaacctga tggac 15 

<210> 147 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 147 



34 



WO 01/04137 
attgtatcag aagtt 



PCT/US00/18803 



15 



<210> 148 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 148 

atttctaaaa ataac 15 



<210> 149 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 149 

attgtatcag aagct ' 15 



<210> 150 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 150 

atttctaaaa atagc 15 



<210> 151 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 151 

taaacttgaa 10 



<210> 152 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 152 

aatcattcat 10 



35 



WO 01/04137 



PCT/USOO/18803 



<210> 153 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 153 

ccggatcttg ' 10 



<210> 154 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 154 

tccgatccag 10 



<210> 155 

<211> 10 

<?12> DNA 

<213> Homo sapiens 

<400> 155 

acaccaccgc 10 



<210> 156 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 156 

tgaggggtgg 10 



<210> 157 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 157 

acctccctgg 10 



<210> 158 
<211> 10 
<212> DNA 



36 



WO 01/04137 



PCT/US00/18803 



<213> Homo sapiens 
<400> 158 

gaaaggatcc 10 



<210> 159 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 159 

aaagcgttaa 10 



<210> 160 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 160 

aagctccagg 10 



<210> 161 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 161 

gtccctgggc 10 



<210> .162 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 162 

cccgtcggct 10 



<210> 163 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 163 



37 



WO 01/04137 



PCT/US00/18803 



gcggggagaa 



10 



<210> 164 
<211> 10 



<212> DNA 



<213> Homo 



sapiens 



<400> 164 



cgagtggagc 



10 



<210> 165 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 165 

tgtcctttta 10 



<210> 166 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 166 

actttgcagc 10 

<210> 167 

<211> 10 

<212> DNA ■ 

<213> Homo sapiens 

<400> 167 

caagctggta 10 

<210> 168 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 168 

acattgacac 10 



38 



WO 01/04137 



PCT/US00/18803 



<210> 169 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 169 

caaaggacca 10 



<210> 170 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 170 

cctctgagca 10 



<210> 171 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 171 

tgcaacataa 10 



<210> 172 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 172 

tactgctact 10 



<210> 173 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 173 

tggtattttc 10 



<210> 174 
<211> 10 
<212> DNA 



39 



WO 01/04137 



PCT/US00/18803 



<213> Homo sapiens 
<400> 174 

gggttcctac 10 



<210> 175 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 175 

ttgcatttta 10 



<210> 176 
<211> 10 
<212> DNA 

<213> Homo sapiens 
<400> 176 

ataaaatatg 10 



<210> 177 
<211> 10 
<212> DNA 

<213> Homo sapiens 
<400> 177 

ccaaagtaaa . 10 



<210> 178 

<2U> 10 

<212> DNA 

<213> Homo sapiens 

<400> 178 

cactctctgc 10 



<210> 179 
<211> 10 
<212> DNA 

<213> Homo sapiens 
<400> 179 
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agagtgtaga 



10 



<210> 180 



<211> 10 



<212> DNA 



<213> Homo sapiens 



<400> 180 



gttttatcct 



10 



<210> 181 



<211> 10 

<212> DNA 

<213> Homo sapiens 



<400> 181 



tagagaggat 



10 



<210> 182 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 182 

gttgccgttt 10 



<210> 183 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 183 

tgctgaagtt 10 

<210> 184 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 184 

gatgtttcca 10 
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<210> 185 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 185 

aaggtatgat 10 



<210> 186 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 186 

tattttagat 10 



<210> 187 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 187 

ggctagttac 10 



<210> 188 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 188 

agttaattgc 10 



<210> 189 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 189 

ctgagagtga 10 



<210> 190 
<211> 10 
<212> DNA 
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<213> Homo 



sapiens 



<400> 190 
aataagacac 



10 



<210> 191 
<211> 9 



<212> DNA 



<213> Homo 



sapiens 



<400> 191 



atggcattg 



9 



<210> 192 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 192 

ttatattcat 10 



<210> 193 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 193 

tgaaaacagc 10 

<210> 194 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 194 

tgccgctgca 10 

<210> 195 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 195 
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tctaaagaag 10 



<210> 196 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 196 

aacctgatgg 10 



<210> 197 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 197 

gtatcagaag 10 



<210> 198 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 198 

tctaaaaata 10 



<210> 199 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 199 

gtacggcgga aactcacagc 20 



<210> 200 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 200 

gcacagcaac ttgttcattg tgg 23 
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<210> 201 

<211> 20 

<212> DNA ■ 

<213> Homo sapiens 

<400> 201 

tctcccaggg gacagacacc 20 



<210> 202 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 202 

taaaaggaca ccctagggga age 23 



<210> 203 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 203 

atcctggagc tttctgeaca cc 22 



<210> 204 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 204 

ctgaatctaa gggaccactt etttge 26 



<210> 205 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 205 

gctaagatga tgccactgtg ttcc 24 



<210> 206 
<211> 23 
<212> DNA 
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<213> Homo 



sapiens 



<400> 206 



gccctgtagt 



ggcaaagtat tec 



23 



<210> 207 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 207 

cctttcctct cacatttcat gage 24 

<210> 208 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 208 

caaacttgac actgcccttt gc 22 

<210> 209 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 209 

acaegtaage tcagttggtc tetge 25 

<210> 210 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 210 

gagcaggggt gagaacaaaa cc 22 

<210> 211 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 211 
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cttccaaagt tttgttggat gcc 
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23 



<210> 212 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 212 

agtttactca tccatgggat ctcg 24 



<210> 213 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 213 

cccagcgaga ggacaaaggt 20 



<210> 214 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 214 

ggaaacctca ggggcttgg .19 



<210> 215 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 215 

cccagccctg aaagcgttaa t 21 



<210> 216 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 216 

aaagcggttt cctgctccag • 20 
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<210> 217 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 217 

ccgcccaagc ttcctaaaaa 20 



<210> 218 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 218 

tctccctctc tctcgctgtc tg 22 



<210> 219 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 219 

agtggaccac ccaggaaacg 20 



<210> 220 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 220 

gcaatggtcc tttggaagca 20 



<210> 221 
<211> -20 
<212> DNA 

<213> Homo sapiens' 
<400> 221 

aactggcaaa ggggatgatg 20 



<210> 222 
<211> 22 
<212> DNA 
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<213> Homo sapiens 
<400> 222 

gcatcgagag tagccttagc tg 22 



<210> 223 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 223 

accagccaac agaagcttga 20 



<210> 224 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 224 

gtccaacaat gattccaaca gg 22 



<210> 225 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 225 

ggtgtcactt aactccctct ca 22 



<210> 226 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 226 

ctgattggac ctggttacct * 20 



<210> 227 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 227 
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ctgtaaacaa tttcagtggc aacccg 
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26 



<210> 228 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<4 00> 228 

ccgtgctatt ctgcattcac tccttg 26 



<210> 229 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 229 

cgtaggaagc tccgatacca atagcc 26 



<210> 230 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 230 

ggagagcagg ggaaaaaaaa gcc 23 



<210> 231 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 231 

ttgaggtttc agaacccgaa gtgaag 26 



<210> 232 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 232 

caacaggaag tatcgcctgc ctttg 25 
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,<210> 233 
<211> 26 
<212> DNA 
<213> Homo sapiens 

<400> 233 

gttaatcctg gagctttctg cacacc 26 



<210> 234 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 234 

tcgctgtctg tctctctctt gctgtc 26 



<210> 235 
<211> 24 . 
<212> DNA 
<213> Homo sapiens 

<400> 235 

tcatgctaag atgatgccac tgtg 24 



<210> 236 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 236 

gcaaagtatt cctctgagca atgg 24 



<210> 237 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 237 

ctgtgttaag agggcatctg ctg 23 



<210> 238 
<211> 22 
<212> DNA 
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<213> Homo 



sapiens 



<400> 238 



ttgaccaaga 



atgtggctgg ag 



22 



<210> 239 

<211> 25 

<212> DNA " 

<213> Homo sapiens 

<400> 239 

gccactaaga ccagccaaca gaagc 25 

<210> 240 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 240 

gagcaggggt gagaacaaaa ccttg 25 

<210> 241 

<211> 26. 

<212> DNA 

<213> Homo sapiens 

<400> 241 

ggctgtgtgt ctcctttagt tcctcg 26 

<210> 242 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 242 

ctgacagttg gattaaccat ttgggg 26 

<210> 243 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 243 
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tgggaaagtg gtacgtcttt gagtgc 
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26 



<210> 244 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 244 

ttgacctctg tgaaaacagc gtgc 24 



<210> 245 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 245 

ctgaaagcct caagtgcctg agaaac 26 
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